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Preface 


This is a textbook on classical electricity and magnetism, designed for 
an undergraduate course at the junior or senior level. It can be covered 
comfortably in two semesters, maybe even with room to spare for special 
topics (AC circuits, numerical methods, plasma physics, transmission lines, 
antenna theory, etc.). A one-semester course could reasonably stop after 
Chapter 7. Unlike quantum mechanics or thermal physics (for example), 
there is a fairly general consensus with respect to the teaching of electro- 
dynamics; the subjects to be included, and even their order of presentation, 
are not particularly controversial, and textbooks differ mainly in style and 
tone. My approach is perhaps less formal than most; I think this makes 
difficult ideas more interesting and accessible. 

For this new edition I have made a number of small changes, in the 
interests of clarity and grace. In a few places I have corrected errors or 
expanded the discussion. I have added some new problems, including a 
few (flagged by a mouse in the margin) that will require a computer. It 
seemed worthwhile to provide at least some minimal guidance to those 
wanting to plot a given field (in the form of representative arrows), to draw 
a field line (starting from a particular point), or to graph the trajectory of a 
charged particle moving through a specified electric and/or magnetic field 
(starting at a particular point with a given velocity). These problems are 
keyed to Mathematica (specifically, version 13.1), but instructors should 
have no trouble converting them to other languages, if they prefer. I have 
added some new commentary on subtle issues: ambiguities in the definition 
of polarization in crystals, problematic aspects of electric and (especially) 
magnetic field lines, the awkward role of intrinsic spin (a strictly quantum 
phenomenon) in a classical discussion of magnetic materials. 

And I have included more references to the accessible literature (par- 
ticularly the American Journal of Physics). I realize, of course, that most 
readers will not have the time or inclination to consult these resources, but 
I think it is worthwhile anyway, if only to emphasize that electrodynamics, 
notwithstanding its venerable age, is very much alive, and intriguing new 
discoveries are being made all the time. I hope that occasionally a problem 
will pique your curiosity, and you will be inspired to look up the references — 
some of them are real gems. 

I have maintained three items of unorthodox notation: 


¢ The Cartesian unit vectors are written x, y, and z (and, in general, all unit 
vectors inherit the letter of the corresponding coordinate). 
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N 


Source point 


Field point 


* The distance from the z-axis in cylindrical coordinates! is designated by 
s, to avoid confusion with r (the distance from the origin, and the radial 
coordinate in the spherical system). 

¢ The script letter 2 denotes the vector from a source point r’ to the field 
point r, as in the figure above. Some authors prefer the more explicit 
(r—r’). But this makes many equations distractingly cumbersome, espe- 
cially when the unit vector % is involved. I realize that unwary readers 
are tempted to interpret 2 as r — it certainly makes the integrals easier! 
Please take note: 2 = (r —r’), which is not the same asr. I think it’s good 
notation, but it does have to be handled with care.” 


As in previous editions, I distinguish two kinds of problems. Some 
have a specific pedagogical purpose, and should be worked immediately 
after reading the section to which they pertain; these I have placed at the 
pertinent point within the chapter. (In a few cases the solution to a problem 
is used later in the text; these are indicated by a bullet (e) in the left margin.) 
Longer problems, or those of a more general nature, will be found at the 
end of each chapter. When I teach the subject, I assign some of these, and 
work a few of them in class. Unusually challenging problems are flagged by 
an exclamation point (!) in the margin. Many readers have asked that the 
answers to problems be provided at the back of the book; unfortunately, 
just as many are strenuously opposed. I have compromised, supplying 
answers when this seems particularly appropriate. A complete solution 
manual is available (to instructors) from the publisher; go to the Cambridge 
University Press website (www.cambridge.org/electrodynamics5e) to order 
a copy. 

I have benefitted from the comments of many colleagues. I cannot list 
them all here, but I would like to thank the following people for especially 
useful contributions to this edition: Catherine Crouch (Swarthmore), Ted 
Jacobson (Maryland), Don Koks (Adelaide), T. R. Lemberger (Ohio State), 


l D, Pontius (Phys. Teach. 55, 388, 2017) suggests we call it the abaxial coordinate. 

2 In MS Word, % is “Kaufmann font,” but this can be very difficult to install in TeX. TeX 
users can download a pretty good facsimile from my website: www.reed.edu/physics/faculty/ 
griffiths/. 
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Kirk McDonald? (Princeton), Jim McTavish (Liverpool), Mahdiyar Noor- 
bala (Tehran), Darrell Schroeter (Reed), Herschel Snodgrass (Lewis and 
Clark), Larry Tankersley (Naval Academy), and above all Joel Franklin 
(Reed). Practically everything I know about electrodynamics — certainly 
about teaching electrodynamics — I owe to Edward Purcell. 


3 Kirk’s website, http://kirkmed.princeton.edu/examples/, is a fantastic resource, with clever 


explanations, nifty problems, and useful references. 
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WHAT IS ELECTRODYNAMICS, AND HOW DOES IT FIT INTO THE GENERAL 
SCHEME OF PHYSICS? 


Four Realms of Mechanics 
Coo EEE ree 


In the diagram below, I have sketched out the four great realms of 


mechanics: 
Classical Mechanics Quantum Mechanics 
(Newton) (Bohr, Heisenberg, Schrödinger, et al.) 
Special Relativity | Quantum Field Theory j 
(Einstein) (Dirac, Pauli, Feynman, Schwinger, et al.) 


Newtonian mechanics is adequate for most purposes in “everyday life,” 
but for objects moving at high speeds (near the velocity of light) it is 
incorrect, and must be replaced by special relativity (introduced by Einstein 
in 1905); for objects that are extremely small (near the size of atoms) 
it fails for different reasons, and is superseded by quantum mechanics 
(developed by Bohr, Heisenberg, Schrödinger, and many others, in the 
1920s, mostly). For objects that are both very fast and very small (as is 
common in modern particle physics), a mechanics that combines relativity 
and quantum principles is in order; this relativistic quantum mechanics is 
known as quantum field theory — it was worked out in the thirties and 
forties, but even today it cannot claim to be a completely satisfactory 
system. In this book, save for the last chapter, we shall work exclusively 
in the domain of classical mechanics, although electrodynamics extends 
with unique simplicity to the other three realms. (In fact, the theory is in 
most respects automatically consistent with special relativity, for which it 
was, historically, the main stimulus.) But please remember, this is a book 
about classical (and in Chapter 12 relativistic) electrodynamics; there is 
no quantum mechanics here, and you can’t expect the theory to explain 
everything. As we’ll see in Chapter 6, the classical theory struggles to 
provide even a qualitative account of magnetic materials. And there is no 
mention of the photon (well ... that’s not quite true: the photon does make 
a brief appearance in Chapter 12, as an example of a massless particle, but 
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never in its more significant role as the quantum of the electromagnetic 
field). Classical electrodynamics describes a possible world, but it is not in 
every detail the world we actually inhabit; there’s more to the story. 


Four Kinds of Forces 
__—— si 


Mechanics tells us how a system will behave when subjected to a given force. 
There are just four fundamental forces known (presently) to physics; I list 
them in the order of decreasing strength: 


(1) Strong 

(2) Electromagnetic 
(3) Weak 

(4) Gravitational 


The brevity of this list may surprise you. Where is friction? Where is the 
“normal” force that keeps you from falling through the floor? Where are the 
chemical forces that bind molecules together? Where is the force of impact 
between two colliding billiard balls? The answer is that a// these forces are 
electromagnetic. Indeed, it is scarcely an exaggeration to say that we live in 
an electromagnetic world — practically every force we experience in everyday 
life, with the exception of gravity, is electromagnetic at its core. 

The strong forces, which hold protons and neutrons together in the 
atomic nucleus, have extremely short range, so we do not “feel” them, in 
spite of the fact that they are a hundred times more powerful than electrical 
forces. The weak forces, which account for certain kinds of radioactive 
decay, are also of short range, and they are far weaker than electromagnetic 
forces. As for gravity, it is so pitifully feeble (compared to all of the others) 
that it is only by virtue of huge mass concentrations (like the Earth and 
the Sun) that we ever notice it at all. The electrical repulsion between two 
electrons is 10” times as large as their gravitational attraction, and if atoms 
were held together by gravitational (instead of electrical) forces, a single 
hydrogen atom would be much larger than the known Universe. 

Not only are electromagnetic forces overwhelmingly dominant in every- 
day life, they are also, at present, the only ones that are completely under- 
stood. There is, of course, a classical theory of gravity (Newton’s law of 
universal gravitation) and a relativistic one (Einstein’s general relativity), 
but no entirely satisfactory quantum mechanical theory of gravity has been 
constructed (though many people are working on it). At the present time 
there is a very successful (if cumbersome) theory for the weak interactions, 
and a strikingly attractive candidate (called chromodynamics) for the strong 
interactions. All these theories draw their inspiration from electrodynamics; 
none can claim conclusive experimental verification at this stage. So electro- 
dynamics, a beautifully complete and successful theory, has become a kind 
of paradigm for physicists: an ideal model that other theories emulate. 
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The laws of classical electrodynamics were discovered in bits and pieces 
by Franklin, Coulomb, Ampère, Faraday, and others, but the person who 
completed the job, and packaged? it all in the compact and consistent 
form it has today, was James Clerk Maxwell. The theory is now about 
150 years old. 


The Unification of Physical Theories 
ee 


In the beginning, electricity and magnetism were entirely separate subjects. 
The one dealt with glass rods and cat’s fur, pith balls, batteries, currents, 
electrolysis, and lightning; the other with bar magnets, iron filings, compass 
needles, and the North Pole. But in 1820, Oersted noticed that an electric 
current could deflect a magnetic compass needle. Soon afterward, Ampère 
postulated that all magnetic phenomena are due to electric charges in 
motion. Then, in 1831, Faraday discovered that a moving magnet generates 
an electric current. By the time Maxwell and Lorentz put the finishing 
touches on the theory, electricity and magnetism were inextricably inter- 
twined. They could no longer be regarded as separate subjects, but rather 
as two aspects of a single subject: electromagnetism. 

Faraday speculated that light, too, is electrical in nature. Maxwell’s 
theory provided spectacular justification for this hypothesis, and soon 
optics — the study of lenses, mirrors, prisms, interference, and diffraction — 
was incorporated into electromagnetism. Hertz, who presented the decisive 
experimental confirmation for Maxwell’s theory in 1888, put it this way: 
“The connection between light and electricity is now established .... In every 
flame, in every luminous particle, we see an electrical process .... Thus, the 
domain of electricity extends over the whole of nature. It even affects 
ourselves intimately: we perceive that we possess ... an electrical organ — the 
eye.” By 1900, then, three great branches of physics — electricity, magnetism, 
and optics — had merged into a single unified theory. (And it was soon 
apparent that visible light represents only a tiny “window” in the vast 
spectrum of electromagnetic radiation, from radio through microwaves, 
infrared and ultraviolet, to X-rays and gamma rays.) 

Einstein dreamed of a further unification, which would combine gravity 
and electrodynamics, in much the same way as electricity and magnetism 
had been combined a century earlier. His unified field theory was not 
particularly successful, but in recent years the same impulse has spawned a 
hierarchy of increasingly ambitious (and speculative) unification schemes, 
beginning in the 1960s with the electroweak theory of Glashow, Wein- 
berg, and Salam (which joins the weak and electromagnetic forces), and 
culminating in the 1980s with the superstring theory (which, according to its 
proponents, incorporates all four forces in a single “theory of everything”). 


4 Actually, the packaging itself was perfected by Oliver Heaviside. 


xvii 


Advertisement 


At each step in this hierarchy, the mathematical difficulties mount, and the 
gap between inspired conjecture and experimental test widens; nevertheless, 
it is clear that the unification of forces initiated by electromagnetism has 
become a major theme in the progress of physics. 


The Field Formulation of Electrodynamics 
EE] 


The fundamental problem a theory of electromagnetism hopes to solve is 
this: I hold up a bunch of electric charges here (and maybe shake them 
around). What happens to some other charge, over there? The classical solu- 
tion takes the form of a field theory: We say that the space around an electric 
charge is permeated by electric and magnetic fields (the electromagnetic 
“odor,” as it were, of the charge). A second charge, in the presence of these 
fields, experiences a force; the fields, then, transmit the influence from one 
charge to the other — they “mediate” the interaction. 

When a charge undergoes acceleration, a portion of the field “detaches” 
itself, in a sense, and travels off at the speed of light, carrying with it 
energy, momentum, and angular momentum. We call this electromagnetic 
radiation. Its existence invites (if not compels) us to regard the fields as 
independent dynamical entities in their own right, every bit as “real” as 
atoms or baseballs. Our interest accordingly shifts from the study of forces 
between charges to the theory of the fields themselves. But it takes a charge 
to produce an electromagnetic field, and it takes another charge to detect 
one, so we had best begin by reviewing the essential properties of electric 
charge. 


Electric Charge 
———————— U L l uI 


(1) Charge comes in two varieties. We call them “plus” and “minus,” 
because their effects tend to cancel (if you have +q and -q at the same 
point, electrically it is the same as having no charge there at all). This may 
seem too obvious to warrant comment, but I encourage you to contemplate 
other possibilities: What if there were 8 or 10 different species of charge? 
(In chromodynamics there are, in fact, three quantities analogous to electric 
charge, each of which may be positive or negative.) Or what if the two kinds 
did not tend to cancel? The extraordinary fact is that plus and minus charges 
occur in exactly equal amounts, to fantastic precision, in bulk matter, so 
that their effects are almost completely neutralized. Were it not for this, we 
would be subjected to enormous forces: a potato would explode violently 
if the cancelation were imperfect by as little as one part in 10!°. 
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(2) Charge is conserved. It cannot be created or destroyed — what there 
is now has always been. (A plus charge can “annihilate” an equal minus 
charge, but a plus charge cannot simply disappear by itself — something 
must pick up that electric charge.) So the total charge of the Universe is 
fixed for all time. This is called global conservation of charge. Actually, 
I can say something much stronger: Global conservation would allow for a 
charge to disappear in New York and instantly reappear in San Francisco 
(that wouldn’t affect the total), and yet we know this doesn’t happen. If the 
charge was in New York and it went to San Francisco, then it must have 
passed along some continuous path from one to the other. This is called 
local conservation of charge. Later on we’ll see how to formulate a precise 
mathematical law expressing local conservation of charge — it’s called the 
continuity equation. 

(3) Charge is quantized. Although nothing in classical electrodynamics 
requires that it be so, the fact is that electric charge comes only in discrete 
lumps — integer multiples of the basic unit of charge. If we call the charge 
on the proton +e, then the electron carries charge —e; the neutron charge 
zero; the pi mesons +e, 0, and —e; the carbon nucleus +6e; and so on (never 
7.392e, or even 1/2e).° This fundamental unit of charge is extremely small, 
so for practical purposes it is usually appropriate to ignore quantization 
altogether. Water, too, “really” consists of discrete lumps (molecules), and 
yet, if you’re taking a drink (or a bath) it’s not unreasonable to think of it 
as a continuous fluid. This is in fact much closer to Maxwell’s own view; 
he knew nothing of electrons and protons — he must have pictured charge 
as a kind of “jelly” that could be divided up into portions of any size and 
smeared out at will. 


Units 
SSS SSS SSS Sed 


The subject of electrodynamics is plagued by competing systems of units, 
which sometimes render it difficult for physicists to communicate with one 
another. The problem is far worse than in mechanics, where Neanderthals 
still speak of pounds and feet; in mechanics, at least all equations look 
the same, regardless of the units used to measure quantities. Newton’s 
second law remains F = ma, whether it is feet-pounds-seconds, kilograms- 
meters—seconds, or whatever. But this is not so in electromagnetism, where 
Coulomb’s law may appear variously as 


MQ - ‘ 1 qq, 1 qiq. 
F= +h F = — — 4 (SI F = — — 4 (HL). 
72 (Gaussian), or iman P (SI), or In m (HL) 


5 Actually, protons and neutrons are composed of three quarks, which carry fractional charges 
(+ Ze and + te). However, free quarks do not appear to exist in nature, and in any event, this 
does not alter the fact that charge is quantized; it merely reduces the size of the basic unit. 
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Of the systems in common use, the two most popular are Gaussian (cgs) and 
SI (mks). Elementary particle theorists favor yet a third system: Heaviside— 
Lorentz. Although Gaussian units offer distinct theoretical advantages, 
most undergraduate instructors seem to prefer SI, I suppose because they 
incorporate the familiar household units (volts, amperes, and watts). In this 
book, therefore, I have used SI units. Appendix C provides a “dictiona” for 
converting the main results into Gaussian units. 


Vector Analysis 


1.1 Vector Algebra 


Coo O ü 
1.1.1 Vector Operations 


If you walk 4 miles due north and then 3 miles due east (Fig. 1.1), you 
will have gone a total of 7 miles, but you’re not 7 miles from where you 
set out — only 5. We need an arithmetic to describe quantities like this, 
which evidently do not add in the ordinary way. The reason they don’t, of 
course, is that displacements (straight line segments going from one point 
to another) have direction as well as magnitude (length), and it is essential 
to take both into account when you combine them. Such objects are called 
vectors: velocity, acceleration, force, and momentum are other examples. By 
contrast, quantities that have magnitude but no direction are called scalars: 
examples include mass, charge, density, and temperature. 

I shall use boldface (A, B, and so on) for vectors and ordinary type 
for scalars. The magnitude of a vector A is written |A| or, more simply, 
A. In diagrams, vectors are denoted by arrows: the length of the arrow is 
proportional to the magnitude of the vector, and the arrowhead indicates 
its direction. Minus A (—A) is a vector with the same magnitude as A 
but the opposite direction (Fig. 1.2). Note that vectors have magnitude 
and direction but not location: a displacement of 4 miles due north from 
Washington is represented by the same vector as a displacement 4 miles 
north from Baltimore (neglecting, of course, the curvature of the Earth). 
On a diagram, therefore, you can slide the arrow around at will, as long as 
you don’t change its length or direction. 
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(A-B) A 
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/ 
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We define four vector operations: addition and three kinds of 
multiplication. 

(i) Addition of two vectors. Place the tail of B at the head of A; the sum, 
A + B, is the vector from the tail of A to the head of B (Fig. 1.3). (This 
rule generalizes the obvious procedure for combining two displacements.) 
Addition is commutative: 


A+B=B+A; 


3 miles east followed by 4 miles north gets you to the same place as 4 miles 
north followed by 3 miles east. Addition is also associative: 


(A+B)+C=A+(B+OC). 
To subtract a vector, add its opposite (Fig. 1.4): 
A-B=A+(-B). 


(ii) Multiplication by a scalar. Multiplication of a vector by a posi- 
tive scalar a multiplies the magnitude but leaves the direction unchanged 
(Fig. 1.5). (If a is negative, the direction is reversed.) Scalar multiplication 
is distributive: 


a(A + B) = aA + aB. 


(iii) Dot product of two vectors. The dot product of two vectors is 
defined by 


A-B= ABcos 0, (1.1) 
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where 0 is the angle they form when placed tail to tail (Fig. 1.6). Note that 
A - Bis itself a scalar (hence the alternative name scalar product). The dot 
product is commutative, 


A-B=B.A, 
and distributive, 
A-(B+C)=A-B+A.-C. (1.2) 


Geometrically, A - B is the product of A times the projection of B along 
A (or the product of B times the projection of A along B). If the two vectors 
are parallel, then A - B = AB. In particular, for any vector A, 


A-A=A’. (1.3) 


If A and B are perpendicular, then A- B = 0. 


Example 1.1. Let C = A — B (Fig. 1.7), and calculate the dot product 
of C with itself. 


Solution. 


C-C=(A-B)-(A-B)=A-A-A-B-B-A+B-B, 


C? = Æ + B -2ABcos@. 


This is the law of cosines. 


(iv) Cross product of two vectors. The cross product of two vectors is 
defined by 


Ax B= ABsin Oi, (1.4) 


where ñ is a unit vector (vector of magnitude 1) pointing perpendicular to 
the plane of A and B. (I shall use a hat (^) to denote unit vectors.) Of course, 
there are two directions perpendicular to any plane: “in” and “out.” The 
ambiguity is resolved by the right-hand rule: let your fingers point in the 
direction of the first vector and curl around (via the smaller angle) toward 
the second; then your thumb indicates the direction of ñ. (In Fig. 1.8, A x B 
points into the page; B x A points out of the page.) Note that A x B is itself 
a vector (hence the alternative name vector product). The cross product is 
distributive, 


Ax (B+C) = (AxB)+(AxQ), (1.5) 
but not commutative. In fact, 


(B x A) = -(A x B). (1.6) 
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Geometrically, |A x B| is the area of the parallelogram generated by A 
and B (Fig. 1.8). If two vectors are parallel, their cross product is zero. In 
particular, 


AxA=0 


for any vector A. (Here 0 is the zero vector, with magnitude 0.) 


Problem 1.1. Using the definitions in Eqs. 1.1 and 1.4, and appropriate diagrams, show that the 
dot product and cross product are distributive: 


(a) when the three vectors are coplanar; 
(b) in the general case. 


Problem 1.2. Is the cross product associative? 
(A xB) x C ŻA x (B xC). 


If so, prove it; if not, provide a counterexample (the simpler the better). 


1.1.2 Vector Algebra: Component Form 


In the previous section, I defined the four vector operations (addition, scalar 
multiplication, dot product, and cross product) in “abstract” form — that 
is, without reference to any particular coordinate system. In practice, it is 
often easier to set up Cartesian coordinates x, y, z and work with vector 
components. Let x, y, and z be unit vectors parallel to the x-, y-, and z-axes, 
respectively (Fig. 1.9a). An arbitrary vector A can be expressed in terms of 
these basis vectors (Fig. 1.9b): 


A = A,X + Ayy + A-2. 
The coefficients 4x, A,, and A, are the “components” of A; geometrically, 
they are the projections of A along the three coordinate axes (A, = A- $, 


Ay = A-Yy, A; = A - 2). We can now reformulate each of the four vector 
operations as a rule for manipulating components: 


A+B = (A,X + Aĵ + A-2) + (ByX + Byy + BZ) 
= (A, + By)& + (A, + B,)¥ + (A, + B-)2. (1.7) 
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Rule (i) To add vectors, add like components: 
aA = (aAx)Ñ + (aAy)¥ + (aA-)2. (1.8) 
Rule (ii) To multiply by a scalar, multiply each component. 
Because x, y, and z are mutually perpendicular unit vectors, 
X°X=y-y=z-z=1, X-y=x-z=y-z=0. (1.9) 
Accordingly, 
A-B= (A,X + A,y + A-2) - (Bxå + Byy + B-2) 
= A,B, + A,B, + A_B,. (1.10) 


Rule (iii) To calculate the dot product, multiply like components, and add. 
In particular, 


A-A=A +A +A, 


A = JA} + A2 + A2. (1.11) 


(This is, if you like, the three-dimensional generalization of the 
Pythagorean theorem.) 


SO 


Similarly, ! 
xXxXx=yxy=2zxz=0, 
ens (1.12) 
YXZ=-ZXY=xX, 
ZXX=-XXZ=yY 
Therefore, 


A x B = (A,X + Ay + A-2) x (Bå + Byy + BZ) 
= (A,B, — A,By)X + (A,B, — Ax B-)y + (Ax By — AyBy)z. (1.13) 


These signs pertain to a right-handed coordinate system (x-axis out of the page, y-axis to the 
right, z-axis up, or any rotated version thereof). In a /eft-handed system (z-axis down), the 
signs would be reversed: x x ŷ = —Z, and so on. We shall use right-handed systems exclusively. 
Always check that you haven’t inadvertently adopted a left-handed system. 
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This cumbersome expression can be written more neatly as a determinant: 


(1.14) 


x 
AX B=]A, 
By 


bo cep 
(e Da N> 


Rule (iv) To calculate the cross product, form the determinant whose first 
row is X, y, 2, whose second row is A (in component form), and whose third 
row is B. 


Example 1.2. Find the angle between the face diagonals of a cube. 


za 
(0, 0, 1) 


0 
A 


x (1, 0, 0) 


Solution. We might as well use a cube of side 1, and place it as shown 
in Fig. 1.10, with one corner at the origin. The face diagonals A and B 
are 


A=1x+0y+1z, B=0x+ly+I1z. 
So, in component form, 
A-B=1-0+0-14+1-1=1. 
On the other hand, in “abstract” form, 
A - B = ABcos @ = V2V2 cos 0 = 2 cos 0. 


Therefore, 
cos 0 = 1/2, or 60 =60°. 


Of course, you can get the answer more easily by drawing in a diagonal 
across the top of the cube, completing the equilateral triangle. But in 
cases where the geometry is not so simple, this device of comparing 
the abstract and component forms of the dot product can be a very 
efficient means of finding angles. 


Problem 1.3. Find the angle between the body diagonals of a cube. 


Problem 1.4. Use the cross product to find the components of the unit vector ñ perpendicular 
to the shaded plane in Fig. 1.11. 
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1.1.3 Triple Products 


Since the cross product of two vectors is itself a vector, it can be dotted or 
crossed with a third vector to form a triple product. 

(i) Scalar triple product: A - (B x C). Geometrically, |A - (B x C)| is the 
volume of the parallelepiped generated by A, B, and C, since |B x C| is the 
area of the base, and |A cos 9| is the altitude (Fig. 1.12). Evidently, 

A: (Bx C)=B-(Cx A) =C.- (A xB), (1.15) 
for they all correspond to the same figure. Note that “alphabetical” order 
is preserved — in view of Eq. 1.6, the “nonalphabetical” triple products 

A-(CxB)=B-(AxC) =C. (Bx A), 


have the opposite sign. In component form, 


A Ay 4; 
A-(BxC)=|B, B, B. (1.16) 
CG G 6 


Note that the dot and cross can be interchanged: 
A- (Bx C) = (AxB)-C 


(this follows immediately from Eq. 1.15); however, the placement of the 
parentheses is critical: (A - B) x C is a meaningless expression — you can’t 
make a cross product from a scalar and a vector. 

(ii) Vector triple product: A x (B x C). The vector triple product can be 
simplified by the so-called BAC—CAB rule: 


Ax (Bx C) = B(A-C)- C(A -B). (1.17) 
Notice that 
(A x B) x C = -C x (A x B) = -A(B- C)+ B(A - C) 


is an entirely different vector (cross products are not associative). All 
higher vector products can be similarly reduced, by repeated application 
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of Eq. 1.17, so it is never necessary for an expression to contain more than 
one cross product in any term. For instance, 


(AxB): (Cx D) = (A: C)(B- D) - (A- D)(B- ©), 


1.1 
A x [Bx (C x D)] = B[A- (C x D)] - (A- B)(C x D). uly) 


Problem 1.5. Prove the BAC—CAB rule by writing out both sides in component form. 


Problem 1.6. Prove that 
[A x (B x C)] + [B x (Cx A)] + [C x (A x B)] = 0. 
Under what conditions does A x (B x C) = (A x B) x C? 


1.1.4 Position, Displacement, and Separation Vectors 


The location of a point in three dimensions can be described by listing its 
Cartesian coordinates (x, y, z). The vector to that point from the origin (0) 
is called the position vector (Fig. 1.13): 


r= xÅ + yŷ +z. (1.19) 


I will reserve the letter r for this purpose, throughout the book. Its magni- 


tude, 
r= 4| X? +y? + 2, (1.20) 


is the distance from the origin, and 


eat XX + VY + 22 

Or J++ 
is a unit vector pointing radially outward. The infinitesimal displacement 
vector, from (x, y, Z) to (x + dx, y + dy, z + dz), is 


(1.21) 


dl = dxX + dy + dzz. (1.22) 


(We could call this dr, since that’s what it is, but it’s useful to have a special 
notation for infinitesimal displacements.) 

In electrodynamics, one frequently encounters problems involving two 
points — typically, a source point, r’, where an electric charge is located, and 
a field point, r, at which you are calculating the electric or magnetic field 
(Fig. 1.14). It pays to adopt right from the start some shorthand notation 
for the separation vector from the source point to the field point. I shall use 
for this purpose the cursive letter 2: 


A=r-r’. (1.23) 
Its magnitude is 
2= |r-1'|, (1.24) 
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xX 
Field point 
and a unit vector in the direction from r’ to r is 
~ A r-r 
A= — = ——. 1.2 
a r-r a 
In Cartesian coordinates, 
a=(x- x) +(y-y)+(z-7z)Ż, (1.26) 
4 = Jæ- x} +0- y) +(z- 7), (1.27) 
z C-xR+o-y + e-r) aas 


V&- x)? + -y P+ E-z)? 
(from which you will appreciate the economy of the script- notation). 


Problem 1.7. Find the separation vector æ from the source point (2, 8, 7) to the field point 
(4, 6, 8). Determine its magnitude (2), and construct the unit vector 2. 


1.1.5 How Vectors Transform 


The? definition of a vector as “a quantity with a magnitude and direction” 
is not altogether satisfactory: What precisely does “direction” mean? This 
may seem a pedantic question, but we shall soon encounter a species of 
derivative that looks rather like a vector, and we’ll want to know for sure 
whether it is one. 

You might be inclined to say that a vector is anything that has three 
components that combine properly under addition. Well, how about this: 
We have a barrel of fruit that contains Ny pears, N, apples, and N, bananas. 
Is N = N,X + N,y + N-2 a vector? It has three components, and when you 
add another barrel with M, pears, M, apples, and M, bananas the result is 


2 This section can be skipped without loss of continuity. 
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(Nx + Mx) pears, (N, + M,) apples, (N; + M-) bananas. So it does add like 
a vector. Yet it’s obviously not a vector, in the physicist’s sense of the word, 
because it doesn’t really have a direction. What exactly is wrong with it? 

The answer is that N does not transform properly when you change 
coordinates. The coordinate frame we use to describe positions in space 1s of 
course entirely arbitrary, but there is a specific geometrical transformation 
law for converting vector components from one frame to another. Suppose, 
for instance, the x, y, Z system is rotated by angle @, relative to x, y, z, about 
the common x = x-axes. From Fig. 1.15, 


A,=Acos0, A-,=Asin0, 
while 


A, = Acos 0 = Acos(0 — p) = A(cos O cos ọ + sin O sind) 
= cos Ọ A, + sin Ọ A-, 
A, = Asin ð = Asin(0 — p) = A (sin O cos ọ — cos O sin Q) 
= -sin Ọ A, + cos Ọ A-. 


We might express this conclusion in matrix notation: 


Ay cosp sind) (A, 
9| = . 1.2 
(2) ‘ sind cosd} \A- Vee) 
More generally, for rotation about an arbitrary axis in three dimensions, 
the transformation law takes the form 


A, > 
ANS | Rx Ry Relays (1.30) 
A R 
or, more compactly, 
3 
Ai = X Ry Aj, (1.31) 


where the index | stands for x, 2 for y, and 3 for z. The elements of the 
matrix R can be ascertained, for a given rotation, by the same sort of 
trigonometric arguments as we used for a rotation about the x-axis. 


n 
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Now, do the components of N transform in this way? Of course not — 
it doesn’t matter what coordinates you use to represent positions in space; 
there are still just as many apples in the barrel. You can’t convert a pear 
into a banana by choosing a different set of axes, but you can turn A, into 
Ay. Formally, then, a vector is a set of three components that transforms the 
same way as (x,y,z) under a rotation of the coordinate axes. As always, 
displacement is the archetype for the behavior of all vectors.* 

By the way, a (second-rank) tensor is a quantity with nine components, 
Txx, Txy, Ty:, Tyx,..-, T22, which transform with two factors of R: 


Txx = Rxx(Rxx Txx + Ry Txy + Rxz Txz) 
+ Ryy (Rx Tyx + Ry Tyy + Ry: Tz) 
T Ry: (Rxx Tx T Ry Ty + Rxz T:z), RiR 


or, more compactly, 
3 3 
Ty = J, >) RaRiTu. (1.32) 


In general, an nth-rank tensor has n indices and 3” components, and 
transforms with n factors of R. In this hierarchy, a vector is a tensor of 
rank 1, and a scalar is a tensor of rank zero.* 


Problem 1.8. 


(a) Prove that the two-dimensional rotation matrix (Eq. 1.29) preserves dot products. (That is, 
show that A,B, + A-B- = A,B, + A-B-.) 

(b) What constraints must the elements (R;;) of the three-dimensional rotation matrix 
(Eq. 1.30) satisfy in order to preserve the length of A (for all vectors A)? 


Problem 1.9. Find the transformation matrix R that describes a rotation by 120° about an 
axis from the origin through the point (1, 1, 1). The rotation is clockwise as you look down the 
axis toward the origin. 


Problem 1.10. Vectors (and scalars, and tensors) are defined by their behavior under a rotation 
of the coordinate axes. But it is sometimes of interest to enquire how they might transform 
(using displacements as a model) under other coordinate changes — translations, say, or 
inversions. 


3 If you’re a mathematician you might want to contemplate generalized vector spaces in which 
the “axes” have nothing to do with direction and the basis vectors are no longer x, y, and 
z (indeed, there may be more than three dimensions). This is the subject of linear algebra. 
But for our purposes all vectors live in ordinary 3-space (or, in Chapter 12, in 4-dimensional 
space-time). 

4 A scalar does not change when you change coordinates. In particular, the individual 
components of a vector are not scalars, but the magnitude is. 
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(a) 


Fig. 1.16 


(a) How do the components of a vector’ transform under a translation of coordinates 
(x = x,y = y—a,Z = z, Fig. 1.16a)? 

(b) How do the components of a displacement transform under an inversion of coordinates 
(x = —x, y = —y, Z = —z, Fig. 1.16b)? 

(c) How do the components of a cross product (Eq. 1.13) transform under inversion? (The cross 
product of two vectors is called a pseudovector because of this “anomalous” behavior 
under inversions. It’s still a vector — that is determined by its behavior under rotations; 
where the distinction is at issue I'll call a vector with the “normal” behavior under inversions 
an ordinary vector.) Is the cross product of two pseudovectors an ordinary vector, or a 
pseudovector? Name two pseudovector quantities in classical mechanics. 

(d) How does the scalar triple product of three vectors transform under inversions? (Such an 
object is called a pseudoscalar.) 


1.2 Differential Calculus 
|| 


1.2.1 “Ordinary” Derivatives 


Suppose we have a function of one variable: f(x). Question: What does the 
derivative, df/dx, do for us? Answer: It tells us how rapidly the function f (x) 
varies when we change the argument x by a tiny amount, dx: 

df = (Lav. (1.33) 
In words, if we increment x by an infinitesimal amount dx, then f changes 
by an amount df; the derivative is the proportionality factor. For example, 


5 Beware: The vector r (Eq. 1.19) goes from a specific point in space (the origin, ©) to the 
point P = (x, y,z). Under a translation the new origin (©) is at a different location, and the 
arrow from © to P is a completely different vector. The original vector r still goes from O 
to P, regardless of the coordinates used to label these points. 
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xy 
“y 


(a) (b) 


in Fig. 1.17a, the function varies slowly with x, and the derivative is 
correspondingly small. In Fig. 1.17b, f increases rapidly with x, and the 
derivative is large, as you move away from x = 0. 

Geometrical Interpretation: The derivative df/dx is the slope of the graph 
of f versus x. 


1.2.2 Gradient 


Suppose now that we have a function of three variables — say, the temper- 
ature T(x, y, Z) in this room. (Start out in one corner, and set up a system 
of axes; then for each point (x, y, z) in the room, T gives the temperature at 
that location.) We want to generalize the notion of “derivative” to functions 
like T, which depend not on one but on three variables. 

A derivative is supposed to tell us how fast the function varies if we 
move a little distance. But this time the situation is more complicated, 
because it depends on what direction we move: if we go straight up, 
then the temperature will probably increase fairly rapidly, but if we move 
horizontally, it may not change much at all. In fact, the question “How fast 
does T vary?” has an infinite number of answers, one for each direction we 
might choose to explore. 

Fortunately, the problem is not as bad as it looks. A theorem on partial 
derivatives states that 

dT = (Fa) dvs (TF) dye (Fy ee (1.34) 
This tells us how T changes when we alter all three variables by the 
infinitesimal amounts dx, dy, dz. Notice that we do not require an infinite 
number of derivatives — three will suffice: the partial derivatives along each 
of the three coordinate directions. 

Equation 1.34 is reminiscent of a dot product: 

OT. OT. 


ôT. Š r R 
dT = i ay) * at) (dx + dyy + dz?) 


=(VT)- (d), (1.35) 
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where 


OT. OT. OT, 
= + Z 


3z (1.36) 


is called the gradient of T. Note that VT is a vector quantity, with three 
components; it is the generalized derivative we have been looking for. 
Equation 1.35 is the three-dimensional version of Eq. 1.33. 

Geometrical Interpretation of the Gradient: Like any vector, the gradient 
has magnitude and direction. To determine its geometrical meaning, let’s 
rewrite the dot product (Eq. 1.35) using Eq. 1.1: 


dT = VT -dl=|VT||dl| cos 0, (1.37) 


where 0 is the angle between VT and dl. Now, if we fix the magnitude 
|dl| and search around in various directions (that is, vary 0), the maximum 
change in T evidentally occurs when 0 = 0 (for then cos @ = 1). That is, 
for a fixed distance |dl|, dT is greatest when I move in the same direction as 
VT. Thus, 


the gradient VT points in the direction of maximum increase of the 
function T. 


Moreover, 


the magnitude |VT| gives the slope (rate of increase) along this 
maximal direction. 


Imagine you are standing on a hillside. Look all around you, and find 
the direction of steepest ascent. That is the direction of the gradient. Now 
measure the s/ope in that direction (rise over run). That is the magnitude of 
the gradient. (Here the function we're talking about is the height of the hill, 
and the coordinates it depends on are positions — latitude and longitude, say. 
This function depends on only two variables, not three, but the geometrical 
meaning of the gradient is easier to grasp in two dimensions.) Notice from 
Eq. 1.37 that the direction of maximum descent is opposite to the direction 
of maximum ascent, while at right angles (9 = 90°) the slope is zero (the 
gradient is perpendicular to the contour lines). You can conceive of surfaces 
that do not have these properties, but they always have “kinks” in them, and 
correspond to nondifferentiable functions. 

What would it mean for the gradient to vanish? If VT = 0 at (x, y, Z), 
then dT = 0 for small displacements about the point (x, y, z). This is, then, 
a stationary point of the function T(x, y, z). It could be a maximum (a 
summit), a minimum (a valley), a saddle point (a pass), or a “shoulder.” 
This is analogous to the situation for functions of one variable, where a 
vanishing derivative signals a maximum, a minimum, or an inflection. In 
particular, if you want to locate the extrema of a function of three variables, 
set its gradient equal to zero. 
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Example 1.3. Find the gradient of r = x? + y? + 2? (the magnitude 
of the position vector). 


Solution. 
or. or. 


= ax i ay + Oz 
1 2x 1 2y . 1 2z 


= R+ yr Z 
2x44? 2 eyerP 2 +47 
xX+yy+zz r . 

=> oo -— nF 


yX +y +2 FT 


Does this make sense? Well, it says that the distance from the origin 
increases most rapidly in the radial direction, and that its rate of 
increase in that direction is 1 ... just what you’d expect. 


Problem 1.11. Find the gradients of the following functions: 


(a) f@, y,2z) =x? +2 +274. 
(b) f(x,y,z) = 2y z. 


(0 f(x, y, z) = e* sin(y) In(z). 
Problem 1.12. The height of a certain hill (in feet) is given by 

h(x, y) = 10(2xy — 3x? — 4y* — 18x + 28y + 12), 
where y is the distance (in miles) north, x the distance east, of South Hadley. 


(a) Where is the top of the hill located? 

(b) How high is the hill? 

(c) How steep is the slope (in feet per mile) at a point 1 mile north and 1 mile east of South 
Hadley? In what direction is the slope steepest, at that point? 


Problem 1.13. Let æ be the separation vector from a fixed point (x’, y’, z’) to the point 
(x, y, Z), and let 2 be its length. Show that 


(a) V(22) = 22; 
(b) V(1/2) = -A/2?. 
(c) What is the general formula for V(2")? 


! 
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Problem 1.14. Suppose f is a function of two variables (y and z) only. Show that the gra- 
dient Vf = (Of/dy)y + (Of/0z)z transforms as a vector under rotations, Eq. 1.29. [Hint: 
(Of/dy) = (Of dy) (dyldy)+(Of10z)(0z/dy), and the analogous formula for 0/7/02. 
We know that y = y cos @+zsin dandzZ = —ysin ġ +z cos ¢; "solve" these equations 
for yand z (as functions of y and Z), and compute the needed derivatives ôy/ð y, Oz/Oy, etc] 


1.2.3 The Del Operator 


The gradient has the formal appearance of a vector, V, “multiplying” a 
scalar T: 


„0 .0 .0 
VT = ET +95, tê) (1.38) 
(For once, I write the unit vectors to the left, just so no one will think this 
means 0x/0x, and so on — which would be zero, since $ is constant.) The 
term in parentheses is called del: 


V=x—+y— +2. (1.39) 


Of course, del is not a vector, in the usual sense. Indeed, it doesn’t 
mean much until we give it a function to act on. Furthermore, it does not 
“multiply” T; rather, it is an instruction to differentiate what follows. To be 
precise, then, we say that V is a vector operator that acts upon T, not a vector 
that multiplies T. 

With this qualification, though, V mimics the behavior of an ordinary 
vector in virtually every way; almost anything that can be done with other 
vectors can also be done with V, if we merely translate “multiply” as “act 
upon.” So by all means take the vector appearance of V seriously: it is a 
marvelous piece of notational simplification, as you will appreciate if you 
ever consult Maxwell’s original work on electromagnetism, written without 
the benefit of V. 

Now, an ordinary vector A can multiply in three ways: 


(1) by ascalar a: Aa; 
(2) by a vector B via the dot product: A - B; 
(3) by a vector B via the cross product: A x B. 


Correspondingly, there are three ways the operator V can act: 


(1) ona scalar function T: VT (the gradient); 
(2) ona vector function v via the dot product: V - v (the divergence); 
(3) on a vector function v via the cross product: V x v (the curl). 


I have already discussed the gradient. In the following sections we examine 
the other two vector derivatives: divergence and curl. 
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1.2.4 The Divergence 


From the definition of V we construct the divergence: 


V-v= RŽ ae +2) * (VxX + VyY + V2Z) 
x Z 


ô dy ð 

Ov, ôv ðv: 
= —+—. 1.40 
Ox + Oy ü Oz ey 


Observe that the divergence of a vector function‘ v is itself a scalar, V - v. 

Geometrical Interpretation: The name divergence is well chosen, for V-v 
is a measure of how much the vector v spreads out (diverges) from the point 
in question. For example, the vector function in Fig. 1.18a has a large (posi- 
tive) divergence (if the arrows pointed in, it would be a negative divergence), 
the function in Fig. 1.18b has zero divergence, and the function in Fig. 1.18c 
again has a positive divergence. (Please understand that v here is a function — 
there’s a vector associated with every point in space. In the diagrams, of 
course, I can only draw the arrows at a few representative locations.) 

Imagine standing at the edge of a pond. Sprinkle some sawdust or pine 
needles on the surface. If the material spreads out, then you dropped it at 
a point of positive divergence; if it collects together, you dropped it at a 
point of negative divergence. (The vector function v in this model is the 
velocity of the water at the surface — this is a two-dimensional example, but 
it helps give one a “feel” for what the divergence means. A point of positive 
divergence is a source, or “faucet”; a point of negative divergence is a sink, 
or “drain.”) 


Example 1.4. Suppose the functions in Fig. 1.18 are va = r = xx+yy+ 
ZL, Vp = Z, and v, = zz. Calculate their divergences. 


Solution. 
ô ô ô 
Vv = mo ap ge a 


A vector function v(x, y, Z) = vx (x, y, 2)X + vy (x,y, ZV + vz(x, y, Z)Ż is really three functions 
— one for each component. There’s no such thing as the divergence of a scalar. 
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as anticipated, this function has a positive divergence. 


0 0 0 
V-w= a ape ppl) =0+0+0=0, 


as expected. 


0 0 0 
Vev = 0 z) = 0+1=1. 
v zO t ay' yes) 0+0+ 


Problem 1.15. Calculate the divergence of the following vector functions: 


(a) va = x2X + 3xz°V — 2xzŻ. 
(b) vp = xyX + 2yzy + 3zx2. 
(Q ve = yR + Qxy + 27)¥ + 2yzĉ. 


Problem 1.16. Sketch the vector function 


LI m y 


and compute its divergence. The answer may surprise you ... how do you account for it? 


Problem 1.17. In two dimensions, show that the divergence transforms as a scalar under 
rotations. [Hint: Use Eq. 1.29 to determine Y, and v-, and the method of Prob. 1.14 to calculate 
the derivatives. Your aim is to show that Ov,/Oy + Ov-/0Z = Ov,/dy + Ov-/0z.] 


1.2.5 The Curl 


From the definition of V we construct the curl: 


x y Z 
Vxv=|d/dx dldy ðlðz 
Vx Vy vz 
m ôv- ðv, M Vy Ov- p ôv, OV 
= ape La - =). 1.41 
uE vad a ax) tlg T ( ) 


Notice that the curl of a vector function” v is, like any cross product, a 
vector. 

Geometrical Interpretation: The name curl is also well chosen, for V x v 
is a measure of how much the vector v swirls around the point in question. 
Thus the three functions in Fig. 1.18 all have zero curl (as you can easily 
check, using the expressions in Ex. 1.4), whereas the functions in Fig. 1.19 
have a substantial curl, pointing in the z direction, as the natural right-hand 
rule would suggest. Imagine (again) you are standing at the edge of a pond. 


7 There’s no such thing as the curl of a scalar. 
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Float a small paddlewheel (a cork with toothpicks pointing out radially 
would do); if it starts to rotate, then you placed it at a point of nonzero 
curl. A whirlpool would be a region of large curl. 


Example 1.5. Suppose the function sketched in Fig. 1.19a is 
Va = —yx + xy, and that in Fig. 1.19b is v, = xy. Calculate their 
curls. 


Solution. 

x y Z 

Vxv,=|d/dx ðlðy dldz| = 22 
-y x 0 


x y 2 
V xvw, =ļ|ðlðx ðlðy ðlðz|= 2. 
0 x 0 


As expected, these curls point in the +z direction. (Incidentally, they 
both have zero divergence, as you might guess from the pictures: 
nothing is “spreading out”... it just “swirls around.”) 


~| 


> Ta 
y <, AO £ is 


= 
y 


4 


r+ 


(a) 


Problem 1.18. Calculate the curls of the vector functions in Prob. 1.15. 


Problem 1.19. Draw a circle in the xy plane. At a few representative points draw the unit 
vector v tangent to the circle, pointing in the clockwise direction. By comparing adjacent vectors, 
determine the signs of Ov,/Ay and ðv,/ðx. According to Eq. 1.41 then, what is the direction 
of V x v? 


Problem 1.20. Construct a vector function that has zero divergence and zero curl every- 
where. (A constant will do the job, of course, but make it something a little more interesting 
than that!) 
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1.2.6 Product Rules 


The calculation of ordinary derivatives is facilitated by a number of rules, 
such as the sum rule: 
_ df dg 
dx ~ ax dx’ 
the rule for multiplying by a constant (c): 


+ 8) 


if 


d d 
mh = Io 


the product rule: 


do dg df 
TIR ER 


and the quotient rule: 
df d; 
d (É ) _ Sax 7 a 
dx \g ey 


Similar relations hold for the vector derivatives. Thus, 


Vif+g)=Vf+Vg, V-(A+B)=(V-A)+(V-B), 


V x (A+B) =(V x A)+(V XB), 
and 
Vicf) =cVf, V-(cA)=c(W-A), Vx (cA) =c(V xA), 


as you can check for yourself. The product rules are not quite so simple. 
There are two ways to construct a scalar as the product of two functions: 


Je (product of two scalar functions), 


A-B (dot product of two vector functions), 
and two ways to make a vector: 


fA (scalar times vector), 


AxB (cross product of two vectors). 


Accordingly, there are six product rules, two for gradients: 

(i) V(fg) =fVg+8Vf, 

(ii) V(A-B)=Ax(VxB)+Bx(VxA)+(A.V)B+(B-. V)A; 
two for divergences: 

Gii) V- (fA) =f(V-A) +A: (Yf); 


(iv) V-(AxB) =B-(VxA)-A-(V xB); 
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and two for curls: 
(v) Vx (fA) =f (V x A) -A x (Vf); 
(vi) V x (A x B) = (B: V)A - (A - V)B + A (V - B) —-B(V- A). 


You will be using these product rules so frequently that I have put them 
inside the front cover for easy reference. The proofs come straight from the 
product rule for ordinary derivatives. For instance, 


V-(fA) = ČJA) + ZA) +ga) 
af, af 
= (4 +f A) G 


=(Vf)-A+f(V-A). 


ra Te 


It is also possible to formulate three quotient rules: 


vÉ) = ere. 

(Ay _ g(V-A)-A- (Vg) 
y a= g2 2 
Vx ($) = ree 


However, since these can be obtained quickly from the corresponding 
product rules, there is no point in listing them separately. 


Problem 1.21. Prove product rules (i), (iv), and (v). 


Problem 1.22. 


(a) If A and B are two vector functions, what does the expression (A - V)B mean? (That is, 
what are its x, y, and z components, in terms of the Cartesian components of A, B, and 
Vv?) 

(b) Compute (f - V)r, where f is the unit vector defined in Eq. 1.21. 

(c) For the functions in Prob. 1.15, evaluate (va - V)vp. 


Problem 1.23. (For masochists only.) Prove product rules (ii) and (vi). Refer to Prob. 1.22 for 
the definition of (A - V)B. 


Problem 1.24. Derive the three quotient rules. 
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Problem 1.25. 
(a) Check product rule (iv) (by calculating each term separately) for the functions 
A=xx+2yy+32z, B= 3yx—-2xy. 


(b) Do the same for product rule (ii). 
(c) Do the same for rule (vi). 


1.2.7 Second Derivatives 


Gradient, divergence, and curl are the only first derivatives we can make 
with V; by applying V twice, we can construct five species of second 
derivatives. The gradient VT is a vector, so we can take the divergence and 
curl of it: 


(a) Divergence of gradient: V - (VT). 
(b) Curl of gradient: V x (VT). 

The divergence V - vis a scalar — all we can do is take its gradient: 
(c) Gradient of divergence: V(V - v). 

The curl V x v is a vector, so we can take its divergence and curl: 
(d) Divergence of curl: V - (V x v). 
(e) Curl of curl: V x (V x v). 


This exhausts the possibilities, and in fact not all of them give us 
anything new. Let’s consider them one at a time: 


„ð ð ð 

(1) VANE) = ety a te) 
ae . oT A oT 
~ Ax2 Ay? Az" 


oT a + êT 
Ox ay Oz 


(1.42) 


This object, which we write as V?T for short, is called the Laplacian 
of T; we shall be studying it in great detail later on. Notice that the 
Laplacian of a scalar T is a scalar. Occasionally, we shall speak of the 
Laplacian of a vector, V?v. By this we mean a vector quantity whose 
x-component is the Laplacian of vy, and so on: 


Vv = (Vv) + (V7 v9 + (V2 V2. (1.43) 


This is nothing more than a convenient extension of the meaning of V?. 
The curl of a gradient is always zero: 


Vx(VT) =0. (1.44) 


(2 


wa 


This is an important fact, which we shall use repeatedly; you can easily 
prove it from the definition of V, Eq. 1.39. Beware: You might think 


8 In curvilinear coordinates, where the unit vectors themselves depend on position, they too 
must be differentiated (see Section 1.4.1). 
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Eq. 1.44 is “obviously” true — isn’t it just (V x V)T, and isn’t the cross 
product of any vector (in this case, V) with itself always zero? This 
reasoning is suggestive, but not quite conclusive, since V is an operator 
and does not “multiply” in the usual way. The proof of Eq. 1.44, in fact, 
hinges on the equality of cross derivatives: 

ð (2) s ð (a ) 
ôx ` dy Oy\ Ax? 


If you think I’m being fussy, test your intuition on this one: 


(1.45) 


(VT) x (VS). 


Is that always zero? (It would be, of course, if you replaced the Vs by an 
ordinary vector.) 

(3) V(V-v) seldom occurs in physical applications, and it has not been given 
any special name of its own — it’s just the gradient of the divergence. 
Notice that V(V - v) is not the same as the Laplacian of a vector: V?v = 
(V-V)v# V(V-y). 

(4) The divergence of a curl, like the curl of a gradient, is always zero: 


V-(V xv) =0. (1.46) 


You can prove this for yourself. (Again, there is a fraudulent shortcut 
proof, using the vector identity A - (B x C) = (A x B)- C.) 
(5) As you can check from the definition of V, 


V x(V xv) = V(V - v) - V’. (1.47) 


So curl-of-curl gives nothing new; the first term is just number (3), and the 
second is the Laplacian (of a vector). (In fact, Eq. 1.47 is often used to define 
the Laplacian of a vector, in preference to Eq. 1.43, which makes explicit 
reference to Cartesian coordinates.) 

Really, then, there are just two kinds of second derivatives: the Lapla- 
cian (which is of fundamental importance) and the gradient-of-divergence 
(which we seldom encounter). I could go through a similar exercise to work 
out third derivatives, but fortunately second derivatives suffice for most 
physical applications. 

A final word on vector differential calculus: It a// flows from the operator 
V, and from taking seriously its vectorial character. Even if you remem- 
bered only the definition of V, you could easily reconstruct all the rest. 


Problem 1.26. Calculate the Laplacian of the following functions: 


(a 
(b 
(c 


(d Cs 2xzt. 


p= ony on, 
Ty = aes 


) T, 
) 
) Te = e>* sin 4y cos 3z. 
) 
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Problem 1.27. Prove that the divergence of a curl is always zero. Check it for function va in 
Prob. 1.15. 


Problem 1.28. Prove that the curl of a gradient is always zero. Check it for function (b) in 
Prob. 1.11. 


1.3 Integral Calculus 
fT -*| 


1.3.1 Line, Surface, and Volume Integrals 


In electrodynamics, we encounter several different kinds of integrals, 
among which the most important are line (or path) integrals, surface 
integrals (or flux), and volume integrals. 

(a) Line Integrals. A line integral is an expression of the form 


b 
i v-dl, (1.48) 


where v is a vector function, dl is the infinitesimal displacement vector 
(Eq. 1.22), and the integral is to be carried out along a prescribed path from 
point a to point b (Fig. 1.20). If the path in question forms a closed loop 
(that is, if b = a), I shall put a circle on the integral sign: 


py dl. (1.49) 


At each point on the path, we take the dot product of v (evaluated at 
that point) with the displacement dl to the next point along the path. To 
a physicist, the most familiar example of a line integral is the work done by 
a force F: W = fE- 

Ordinarily, the value of a line integral depends critically on the path 
taken from a to b, but there is an important special class of vector functions 
for which the line integral is independent of path and is determined entirely 
by the end points. It will be our business in due course to characterize 
this special class of vectors. (A force that has this property is said to be 
conservative.) 


Example 1.6. Calculate the line integral of the function v = y’$ + 
2x(y + l)y from the point a = (1, 1,0) to the point b = (2, 2,0), along 
the paths (1) and (2) in Fig. 1.21. What is $ v- dl for the loop that goes 


from a to b along (1) and returns to a along (2)? 


Solution. As always, dl = dxx + dyy + dzz. Path (1) consists of two 
parts. Along the “horizontal” segment, dy = dz = 0, so 
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N 
> 


dl 


=y 


(i) dl=dxx, y = 1, v- dl = y? dx = dx, so f v- d= f? dx= 1. 
On the “vertical” stretch, dx = dz = 0, so 
(ii) dl = dy y, x = 2, v - dl = 2x(y + 1) dy = 4(y + 1) dy, so 


2 
[vara (vy + 1) dy = 10. 
1 


b 
f v-dl=1+10=11. 


Meanwhile, on path (2) x = y, dx = dy, and dz = 0, so dl = dx x + 
dx y, v- dl = x? dx + 2x(x + 1) dx = (3x? + 2x) dx, and 


b 2 
f v-dl= f (3x? + 2x) dx = (x°? + xik = 10. 
a 1 


(The strategy here is to get everything in terms of one variable; I could 
just as well have eliminated x in favor of y.) 
For the loop that goes out (1) and back (2), then, 


fv dl=11-10=1, 


(b) Surface Integrals. A surface integral is an expression of the form 


[ova (1.50) 
S 


where v is again some vector function, and the integral is over a specified 
surface S. Here da is an infinitesimal patch of area, with direction per- 
pendicular to the surface (Fig. 1.22). There are, of course, two directions 
perpendicular to any surface, so the sign of a surface integral is intrinsically 
ambiguous. If the surface is closed (forming a “balloon”), in which case I 
shall again put a circle on the integral sign 


fv da, 


By path (1), then, 
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da 2 


then tradition dictates that “outward” is positive, but for open surfaces it’s 
arbitrary. If v describes the flow of a fluid (mass per unit area per unit time), 
then f v - da represents the total mass per unit time passing through the 
surface — hence the alternative name, “flux.” 

Ordinarily, the value of a surface integral depends on the particular 
surface chosen, but there is a special class of vector functions for which 
it is independent of the surface and is determined entirely by the boundary 
line. An important task for us will be to characterize this special class of 
functions. 


Example 1.7. Calculate the surface integral of v = 2xzx + (x + 2)y + 
y(z? — 3)Ż over five sides (excluding the bottom) of the cubical box 
(side length 2) in Fig. 1.23. Let “upward and outward” be the positive 
direction, as indicated by the arrows. 


Solution. Take the sides one at a time: 


(i) x = 2, da = dy dzx, v- da = 2xz dy dz = 4z dy dz, so 


2 2 
fraza d { zdz = 16. 
0 0 


(ii) x = 0, da = -dy dz, v- da = —2xz dy dz = 0, so 


frazo 


ii) y = 2, da = dx dzy, v- da = (x + 2) dx dz, so 


(iv) y = 0, da = -dx dzy, v- da = —(x + 2) dx dz, so 


2 2 
fraz- | œa f dz = -12. 
0 0 


(v) z= 2, da = dx dyz, v - da = y(z’ — 3) dx dy = y dx dy, so 


2 2 
fva f ax f ydy =4. 
0 0 
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The total flux is 


v-da=16+0+4+12-124+4=20. 


surface 


(c) Volume Integrals. A volume integral is an expression of the form 


f T dt, (1.51) 
v 


where T is a scalar function and dr is an infinitesimal volume element. In 
Cartesian coordinates, 


dt = dx dy dz. (1.52) 


For example, if T is the density of a substance (which might vary from point 
to point), then the volume integral would give the total mass. Occasionally, 
we shall encounter volume integrals of vector functions: 


[var = [okey evade = x [nat +9 f mde +a f v- dT; 
(1.53) 
because the unit vectors ($, y, and z) are constants, they come outside the 


integral. 


Example 1.8. Calculate the volume integral of T = xyz? over the 
prism in Fig. 1.24. 


Solution. You can do the three integrals in any order. Let’s do x first: 
it runs from 0 to (1 — y), then y (it goes from 0 to 1), and finally 
z (0 to 3): 


fra -fafoff xa] dy} dz 


eF i zo l 1, 3 
fo | a-ya =O) = § 
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Problem 1.29. Calculate the line integral of the function v = x°X + 2yzy + y*Ż from the 
origin to the point (1,1,1) by three different routes: 


(a 
(b 
(c 
(d 


(0,0,0) > (1,0,0) > (1,1,0) > (1,1,1). 

(0,0,0) > (0,0,1) > (0,1,1) > (1,1,1). 

The direct straight line. 

Whatis the line integral around the closed loop that goes out along path (a) and back along 
path (b)? 


— ae a 


Problem 1.30. Calculate the surface integral of the function in Ex. 1.7, over the bottom of the 
box. For consistency, let “upward” be the positive direction. Does the surface integral depend 
only on the boundary line for this function? What is the total flux over the closed surface of the 
box (including the bottom)? [Note: For the closed surface, the positive direction is “outward,” and 
hence “down,” for the bottom face.] 


Problem 1.31. Calculate the volume integral of the function T = z* over the tetrahedron 
with corners at (0, 0,0), (1, 0, 0), (0, 1, 0), and (0, 0, 1). 


1.3.2 The Fundamental Theorem of Calculus 


Suppose f(x) is a function of one variable. The fundamental theorem of 
calculus says that 


b 
{ (4) dx = f(b) - f(a). (1.54) 


In case this doesn’t look familiar, P11 write it another way: 


b 
{ F(x) dx = f(b) -f(a), 


where df/dx = F(x). The fundamental theorem tells you how to integrate 
F(x): you think up a function f(x) whose derivative is equal to F. 
Geometrical Interpretation: According to Eq. 1.33, df = (df/dx) dx is the 
infinitesimal change in f when you go from (x) to (x+dx). The fundamental 
theorem (Eq. 1.54) says that if you chop the interval from a to b (Fig. 1.25) 
into many tiny pieces, dx, and add up the increments df from each little 
piece, the result is (not surprisingly) equal to the total change in f: f(b) -— 
f(a). In other words, there are two ways to determine the total change in 
the function: either subtract the values at the ends or go step by step, adding 
up all the tiny increments as you go. You'll get the same answer either way. 
Notice the structure of the fundamental theorem: the integral of a 
derivative over some region is given by the value of the function at the end 
points (boundaries). In vector calculus there are three species of derivative 
(gradient, divergence, and curl), and each has its own “fundamental theo- 
rem,” with essentially the same format. I don’t plan to prove these theorems 
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f(b) 
fa) 


here; rather, I will explain what they mean, and try to make them plausible. 
Proofs are given in Appendix A. 


1.3.3 The Fundamental Theorem for Gradients 


Suppose we have a scalar function of three variables T(x, y, z). Starting at 
point a, we move a small distance dl, (Fig. 1.26). According to Eq. 1.37, 
the function T will change by an amount 


dT = (VT) - dh. 


Now we move a little farther, by an additional small displacement dl); the 
incremental change in T will be (YT) - dh. In this manner, proceeding by 
infinitesimal steps, we make the journey to point b. At each step we compute 
the gradient of T (at that point) and dot it into the displacement dl ... this 
gives us the change in T. Evidently, the total change in T in going froma 
to b (along the path selected) is 


b 
f (VT) - dl = T(b) — T(a). (1.55) 


This is the fundamental theorem for gradients; like the “ordinary” funda- 
mental theorem, it says that the integral (here a line integral) of a derivative 
(here the gradient) is given by the value of the function at the boundaries 
(a and b). 

Geometrical Interpretation: Suppose you wanted to determine the height 
of the Eiffel Tower. You could climb the stairs, using a ruler to measure 
the rise at each step, and adding them all up (that’s the left-hand side of 
Eq. 1.55), or you could place altimeters at the top and the bottom, and 
subtract the two readings (that’s the right-hand side); you should get the 
same answer either way (that’s the fundamental theorem). 

Incidentally, as we found in Ex. 1.6, line integrals ordinarily depend on 
the path taken from a to b. But the right-hand side of Eq. 1.55 makes no 
reference to the path — only to the end points! Evidently, gradients have the 
special property that their line integrals are path independent: 
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Corollary 1. The integral fi PYT) - dl is independent of the path 
taken from a to b. 


Corollary 2. The integral $ (VT) - dl = 0, since the beginning and 
end points are identical, and hence T(b) — T(a) = 0. 


Example 1.9. Let T = xy’, and take point a to be the origin (0, 0, 0) 
and b the point (2, 1,0). Check the fundamental theorem for gradients. 


Solution. Although the integral is independent of path, we must pick 
a specific path in order to evaluate it. Let’s go out along the x-axis (step 
i) and then up (step ii) (Fig. 1.27). As always, dl = dx x + dyy + dz; 
VT =y’ R + 2xyy. 


(i) y= 0, dl = dx x, VT - dl = y? dx = 0, so 


f vr-a=o. 
© 


Gi) x = 2, dl = dy ŷ, VT - dl = 2xy dy = 4y dy, so 


1 
f vr-a=f wizh =2. 
(ii) 0 


The total line integral is 2. Is this consistent with the fundamental 
theorem? Yes: T(b) — T(a) = 2-0 =2. 

Now, just to convince you that the answer is independent of path, 
let me calculate the same integral along path (ii) (the straight line from 
a to b): 


(iii) y= $x, dy = } dx, VT - dl = y? dx + 2xy dy = x? dx, so 


2 
_ 3a, 1.32 _ 
fora- f 4” dx = 3% b = 2. v 


(iii) 


Problem 1.32. Check the fundamental theorem for gradients, using T = x? +4xy+2yz3, 
the pointsa = (0,0, 0), b = (1, 1, 1), and the three paths in Fig. 1.28: 


(a) (0,0,0) > (1,0,0) > (1,1,0) > (1,1, 1); 
(b) (0,0,0) => (0,0, 1) > (0,1,1) > (1, 1, 1); 
() the parabolic path z = x?, y = x. 
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1.3.4 The Fundamental Theorem for Divergences 


The fundamental theorem for divergences states that 


fom dt = $ v: da. (1.56) 
v S 


In honor, I suppose, of its great importance, this theorem has at least three 
different names: Gauss’s theorem, Green’s theorem, or simply the divergence 
theorem. Like the other “fundamental theorems,” it says that the integral of 
a derivative (in this case the divergence) over a region (in this case a volume, V) 
is equal to the value of the function at the boundary (in this case the surface S 
that bounds the volume). Notice that the boundary term is itself an integral 
(specifically, a surface integral). This is reasonable: the “boundary” of a line 
is just two end points, but the boundary of a volume is a (closed) surface. 

Geometrical Interpretation: If v represents the flow of an incompressible 
fluid, then the flux of v (the right-hand side of Eq. 1.56) is the total amount 
of stuff passing out through the surface, per unit time. Now, the divergence 
measures the “spreading out” of the vectors from a point — a place of 
high divergence is like a “faucet,” pouring out liquid. If we have a bunch 
of faucets in a region completely filled with incompressible fluid, an equal 
amount of liquid will be forced out through the boundaries of the region. 
In fact, there are two ways we could determine how much is being produced: 
(a) we could count up all the faucets, recording how much each puts out, or 
(b) we could go around the boundary, measuring the outflow at each point, 
and add it all up. You get the same answer either way: 


{ (faucets within the volume) = $ (flow out through the surface). 


This, in essence, is what the divergence theorem says. 


Example 1.10. Check the divergence theorem using the function 


v= yx + (2xy + PY + (2yz)z 


and a unit cube at the origin (Fig. 1.29). 


1 Vector Analysis 


=y 


Solution. In this case, 


V-v=2(x+y) 


1 pl Al 
[arna f f [ c+ ndedyads 
v 0 Jo Jo 
1 1 1 
f cernde= bey, f Gendr=1. f iesi 
0 0 0 


Thus, 
f V-vdt =2. 
v 


So much for the left-hand side of the divergence theorem. To 
evaluate the surface integral we must consider separately the six faces 
of the cube: 


1 pl 
O fva f f EA 
0 Jo 3 
1 1 1 
Gi) | v-da=- y dy dz =--. 
0 Jo 3 
1 1 4 
(iii) fva- f ae 
Gv) fv da=- [fe de de = -4 
v fv das f ie 2y dx dy = 1. 
0 
1 pl 
(vi) fva- { Odx dy = 0. 
0 Jo 


So the total flux is 


1 1 4 1 
-da=~-=+=--+1 =2 
$» a 3 3+3 zt +0=2, 


and 


as expected. 
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Problem 1.33. Test the divergence theorem for the function v = (xy)x+(2yz)y+(3zx)z. 
Take as your volume the cube shown in Fig. 1.30, with sides of length 2. 


AZ 
2 


<y 


Fig. 1.30 


1.3.5 The Fundamental Theorem for Curls 


The fundamental theorem for curls, which goes by the special name of 
Stokes’ theorem, states that 


[ox da= ve. (1.57) 
s P 


As always, the integral of a derivative (here, the curl) over a region (here, a 
patch of surface, S) is equal to the value of the function at the boundary 
(here, the perimeter of the patch, P). As in the case of the divergence 
theorem, the boundary term is itself an integral — specifically, a closed line 
integral. 

Geometrical Interpretation: Recall that the curl measures the “twist” of 
the vectors v; a region of high curl is a whirlpool — if you put a tiny paddle 
wheel there, it will rotate. Now, the integral of the curl over some surface 
(or, more precisely, the flux of the curl through that surface) represents the 
“total amount of swirl,” and we can determine that just as well by going 
around the edge and finding how much the flow is following the boundary 
(Fig. 1.31). Indeed, $ v- dlis sometimes called the circulation of v. 

You may have noticed an apparent ambiguity in Stokes’ theorem: con- 
cerning the boundary line integral, which way are we supposed to go around 
(clockwise or counterclockwise)? If we go the “wrong” way, we’ll pick up 
an overall sign error. The answer is that it doesn’t matter which way you go 
as long as you are consistent, for there is a compensating sign ambiguity in 
the surface integral: Which way does da point? For a closed surface (as in 
the divergence theorem), da points in the direction of the outward normal; 
but for an open surface, which way is “out”? Consistency in Stokes’ theorem 
(as in all such matters) is given by the right-hand rule: if your fingers point 
in the direction of the line integral, then your thumb fixes the direction of 
da (Fig. 1.32). 
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da 


A 
Y 


Now, there are plenty of surfaces (infinitely many) that share any given 
boundary line. Twist a paper clip into a loop, and dip it in soapy water. The 
soap film constitutes a surface, with the wire loop as its boundary. If you 
blow on it, the soap film will expand, making a larger surface, with the same 
boundary. Ordinarily, a flux integral depends critically on what surface you 
integrate over, but evidently this is not the case with curls. For Stokes’ theo- 
rem says that f (Vxv)-da is equal to the line integral of v around the bound- 
ary, and the latter makes no reference to the specific surface you choose. 


Corollary 1. The integral f (V x v) - da depends only on the 
boundary line, not on the particular surface used. 


Corollary 2. The integral $ (V x v) - da = 0 for any closed surface, 
since the boundary line, like the mouth of a balloon, shrinks down 
to a point, and hence the right-hand side of Eq. 1.57 vanishes. 


These corollaries are analogous to the ones for the gradient theorem. We 
will develop the parallel further in due course. 


Example 1.11. Suppose v = (2xz + 3y*)y + (4yz’)z. Check Stokes’ 
theorem for the square surface shown in Fig. 1.33. 


Solution. Here, 
Vxv= (47 —2x)k +222 and da=dydzx. 


(In saying that da points in the x direction, we are committing our- 
selves to a counterclockwise line integral. We could as well write da = 
—dy dzx, but then we would be obliged to go clockwise.) Since x = 0 
for this surface, 


1 1 4 
[oxa f { 47° dy dz = =. 
0 Jo 3 


Now, what about the line integral? We must break this up into four 
segments: 


(i) x=0,2=0,v- dl =3y? dy, fv-dl= f) 3p? dy =1; 
a _ _ 42 _ fix 4: 
Gi) x=0,y=1,v-dl=42 dz, {v-dl= f 4z dz = 3; 


Gii) x = 0, z = 1, v: dl = 3y? dy, f v- d= f? 3y? dy = -1; 
(iv) x=0,y=0,v-dl=0, fv-dl= ff 0dz=0. 
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So 


4 4 
fyidi=145-140=5, 
It checks. 


A point of strategy: notice how I handled step (iii). There is a 
temptation to write dl = —dyy here, since the path goes to the left. 
You can get away with this, if you absolutely insist, by running the 
integral from 0 — 1. But it is much safer to say dl = dxx + dy + dzz 
always (never any minus signs) and let the limits of the integral take 
care of the direction. 


Problem 1.34. Test Stokes’ theorem for the function v = (xy)x+(2yz)y + (32x)z, using 
the triangular shaded area of Fig. 1.34. 


Problem 1.35. Check Corollary 1 by using the same function and boundary line as in Ex. 1.11, 
but integrating over the five faces of the cube in Fig. 1.35. The back of the cube is open. 


a 
= 
z 
= 
=~ 
= 
XE 
A 
fo 
= 
= 
es 
= 
< 


Fig. 1.34 Fig. 1.35 


1.3.6 Integration by Parts 


The technique known (awkwardly) as integration by parts exploits the 
product rule for derivatives: 


d d df 
PAS =s(Ž) sel) 
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Integrating both sides, and invoking the fundamental theorem, 


b b b d 
S Eoas | Ea f E)a 


b b 
f (E) ax = -f o(L) ax + fell, (1.58) 


That’s integration by parts. It applies to the situation in which you are called 
upon to integrate the product of one function (f) and the derivative of 
another (g); it says you can transfer the derivative from g to f, at the cost 
of a minus sign and a boundary term. 


or 


Example 1.12. Evaluate the integral 


o0 
i xe™ dx. 
0 


Solution. The exponential can be expressed as a derivative: 


xd -x 
e =R ET]. 


In this case, then, f(x) = x, g(x) = —e™*, and df/dx = 1, so 
f xe™ dx = f e*dx-—xe™| =-e|) =1. 
0 0 0 


We can exploit the product rules of vector calculus, together with the 
appropriate fundamental theorems, in exactly the same way. For example, 
integrating 


VGA) =fV Ar Ary) 


over a volume, and invoking the divergence theorem, yields 
[v ead | ror-aydes | A-copdr =p fa da, 
v v v S 


[I0 aar- f A 0Ndr+ G fA- da (1.59) 


Here again the integrand is the product of one function (f) and the 
derivative (in this case the divergence) of another (A), and integration by 
parts allows us to transfer the derivative from A to f (where it becomes a 
gradient), at the cost of a minus sign and a boundary term (in this case a 
surface integral). 

You might wonder how often one is likely to encounter an integral 
involving the product of one function and the derivative of another. 
The answer is... surprisingly often! And integration by parts turns out to 
be one of the most powerful tools in vector calculus. 


or 
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Problem 1.36. 


(a) Show that 
[fev xA)-da= f AxN dato faa (1.60) 
S S P 
(b) Show that 


[Becxarde= f A-(rxBydr+ (AxB): da. (1.61) 
v Vv S 


1.4 Curvilinear Coordinates 
S R) 


1.4.1 Spherical Coordinates 


You can label a point P by its Cartesian coordinates (x, y, z), but sometimes 
it is more convenient to use spherical coordinates (r, 0, œ); r is the distance 
from the origin (the magnitude of the position vector r), 0 (the angle down 
from the z-axis) is called the polar angle, and ¢ (the angle around from the 
x-axis) is the azimuthal angle. Their relation to Cartesian coordinates can 
be read from Fig. 1.36: 


x=rsinOcosd, y=rsinOsind, z=rcosé. (1.62) 


Figure 1.36 also shows three unit vectors, fê, Ô, @, pointing in the 
direction of increase of the corresponding coordinates. They constitute an 
orthogonal (mutually perpendicular) basis set (just like x, y, z), and any 
vector A can be expressed in terms of them, in the usual way: 


A = A,ĉ + 490 + Ago: (1.63) 


where A,, Ag, and Ag are the radial, polar, and azimuthal components of 
A. In terms of the Cartesian unit vectors, 


<y 
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f = sin 0 cos ọ $ + sin Osindy + cos Oz, 
Ô = cos 0 cos PX + cos Osingd ĵ — sin 0 2, (1.64) 
p = -sinp $ +cosoĵ, 
as you can check for yourself (Prob. 1.38). I have put these formulas inside 
the back cover, for easy reference. 

But there is a poisonous snake lurking here that I’d better warn you 
about: f, Ô, and @ are associated with a particular point P, and they change 
direction as P moves around. For example, ê always points radially outward, 
but “radially outward” can be the x direction, the y direction, or any other 
direction, depending on where you are. In Fig. 1.37, A = y and B = —y, and 
yet both of them would be written as f in spherical coordinates. One could 
take account of this by explicitly indicating the point of reference: r(0, Q), 
ô(9, o), po, p), but this would be cumbersome, and as long as you are 
alert to the problem, I don’t think it will cause difficulties.’ In particular, 
do not naively combine the spherical components of vectors associated 
with different points (in Fig. 1.37, A + B = 0, not 2f, and A- B = -1, 
not +1). Beware of differentiating a vector that is expressed in spherical 
coordinates, since the unit vectors themselves are functions of position 
(08/00 = Ê, for example). And do not take f, Ô, and Q outside an integral, 
as I did with x, y, and z in Eq. 1.53. In general, if you’re uncertain about 
the validity of an operation, express everything in Cartesian coordinates; 
they are snakeproof. 

An infinitesimal displacement in the r direction is simply dr (Fig. 1.38a), 
just as an infinitesimal element of length in the x direction is dx: 


dl, = dr. (1.65) 


On the other hand, an infinitesimal element of length in the @ direction 
(Fig. 1.38b) is not just dO (that’s an angle — it doesn’t even have the right 
units for a length); rather, 


dlg =r d0. (1.66) 

Similarly, an infinitesimal element of length in the @ direction (Fig. 1.38c) is 
dl, = rsin 0 dọ. (1.67) 

? I claimed back at the beginning that vectors have no location, and I’ll stand by that. The 
vectors themselves live “out there,” completely independent of our choice of coordinates. 


But the notation we use to represent them does depend on the point in question, in curvilinear 
coordinates; f, 0, and @ stand for different vectors at different locations. 
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7 7 
r 
yee 
) 


(b) 


(a 


Thus the general infinitesimal displacement dl is 
d= drê +rd0 Ô +r sin 0 do Q. (1.68) 


This plays the role (in line integrals, for example) that dl = dxx+dyy+dzz 
played in Cartesian coordinates. 

The infinitesimal volume element drt, in spherical coordinates, is the 
product of the three infinitesimal displacements: 


dt = dl, dlg dly = r sin 0 dr d0 dọ. (1.69) 


I cannot give you a general expression for surface elements da, since these 
depend on the orientation of the surface. You simply have to analyze 
the geometry for any given case (this goes for Cartesian and curvilinear 
coordinates alike). If you are integrating over the surface of a sphere, for 
instance, then r is constant, whereas 0 and ¢ change (Fig. 1.39), so 


da, = dle dlyt = r’ sin O dO dot. 


On the other hand, if the surface lies in the xy plane, say, so that @ is 
constant (to wit, 7/2) while r and ¢ vary, then 


da, = dl, dl, Ê = r dr dọ Ô. 


Notice, finally, that r ranges from 0 to œ, ¢ from 0 to 27, and 0 from 0 
to n (not 27 — that would count every point twice).!° 


10 Alternatively, you could run @ from 0 to m (the “eastern hemisphere”) and cover the 
“western hemisphere” by extending 0 from n up to 27. But this is very bad notation, since, 
among other things, sin 0 will then run negative, and you'll have to put absolute value 
signs around that term in volume and surface elements (area and volume being intrinsically 
positive quantities). 
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Example 1.13. Find the volume of a sphere of radius R. 


v= fa f T f r sin 0 dr d0 do 
8=0 J p=0 


Solution. 


an pafi sinoao)( { ao) 


=(5 NOn) = on 


(not a big surprise). 


So far we have talked only about the geometry of spherical coordinates. 
Now I would like to “translate” the vector derivatives (gradient, divergence, 
curl, and Laplacian) into r, 9, @ notation. In principle, this is entirely 
straightforward: in the case of the gradient, 


we would first use the chain rule to expand the partials: 
dx dr dx! 90 * 86 Ox!” 

The terms in parentheses could be worked out from Eq. 1.62 — or rather, 
the inverse of those equations (Prob. 1.37). Then we’d do the same for 
OT/dy and OT/dz. Finally, we’d substitute in the formulas for x, y, and 
z in terms of f, Ô, and È (Prob. 1.38). It would take an hour to figure 
out the gradient in spherical coordinates by this brute-force method. I 
suppose this is how it was first done, but there is a much more efficient 
indirect approach, explained in Appendix A, which has the extra advantage 
of treating all coordinate systems at once. I described the “straightforward” 
method only to convince you that there is nothing subtle or mysteri- 
ous about transforming to spherical coordinates: you’re expressing the 
same quantity (gradient, divergence, or whatever) in different notation, 
that’s all. 

Here, then, are the vector derivatives in spherical coordinates: 

Gradient: 


ƏT. tors, 1 ôT, 
Veo Ge TT gira 
Divergence: 
1 ð 1 ðv 
. r? ——— — j 
n an + ind 008 OO) + Ting ag TD) 
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Curl: 
1 pð, z j 
Vxv= mam lag sin ve) - Fe — i+ “laa ne rol? 
ô Ov, i? 

+ Lae) = 351% (72) 
Laplacian 

1 0 ôT 1 ð ôT 1 æT 
Pre i pa ——— 1.73 

r2 a" ar)* Zand 36 Si" 956) id r? sin? 0 ðQ? S 


For reference, these formulas are listed inside the front cover. 


Problem 1.37. Find formulas for 7, O, œ in terms of x, y, z (the inverse, in other words, of 
Eq. 1.62). 


e Problem 1.38. Express the unit one Ô, pi in terms of x, y, z (that is, derive Eq. 1.64). 
Check your answers several ways (f - ê = ti; ô. Q is 0,r x ô as Q.. . .). Also work out 


the inverse formulas, giving x, y, z in terms of f, Ô, o (and 0, Q). 


¢ Problem 1.39. 


(a) Check the divergence theorem for the function v; = 7? 


of radius R, centered at the origin. 
(b) Do the same for vz = (1/r?)f. (If the answer surprises you, look back at Prob. 1.16.) 


f, using as your volume the sphere 


Problem 1.40. Compute the divergence of the function 
v = (rcos @)r + (rsin 0)ô + (rsin 0 cos po. 


Check the divergence theorem for this function, using as your volume the inverted hemispheri- 
cal bowl of radius R, resting on the xy plane and centered at the origin (Fig. 1.40). 


<y 


R 


Fig. 1.40 Fig. 1.41 


Problem 1.41. Compute the gradient and Laplacian of the scalar function T = r(cos 0 + 
sin 0 cos œ). Check the Laplacian by converting 7 to Cartesian coordinates and using 
Eq. 1.42. Test the gradient theorem for this function, using the path shown in Fig. 1.41, from 
(0, 0, 0) to (0, 0, 2). 
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1.4.2 Cylindrical Coordinates 


The cylindrical coordinates (s,@,z) of a point P are defined in Fig. 1.42. 
Notice that @ has the same meaning as in spherical coordinates, and z 
is the same as Cartesian; s is the distance to P from the z-axis, whereas 
the spherical coordinate r is the distance from the origin. The relation to 
Cartesian coordinates is 


x=scosd, y=ssingd, Z=Z. (1.74) 
The unit vectors (Prob. 1.42) are 
S=cosP@xX+singy, 


fp = -sin@X + cos hy, (1.75) 
Zz 


Il 
N> 


The infinitesimal displacements are 
dl,=ds, dly=sdo, dl, = dz, (1.76) 
so 


dl = ds + sdo o + dz2, (1.77) 
and the volume element is 
dt = sds dọ dz. (1.78) 


The range of s is 0 — œ, @ goes from 0 > 27, and z from —o to oo. 
The vector derivatives in cylindrical coordinates are 


Gradient: 
OT. 10T. OT. 
VT = au + 3 0 + Re (1.79) 
Divergence: 
1 1 Ov : 
vve 2a oo (1.80) 
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Curl: 

ldv, Ovos, (Ov, ðv, 1,0 OVs1. 
aa er a] (Se as)? * 5 las - ag l* (1.81) 
Laplacian: 


WT= 


2 2 
U 1T oT (1.82) 


as! * Rap a2 


These formulas are also listed inside the front cover. 
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Problem 1.42. Express the cylindrical unit vectors s, Q, 2 in terms of x, y, 2 (that is, derive 
Eq. 1.75). “Invert” your formulas to get x, y, z in terms of S, $, 2 (and @). 


Fig. 1.43 


Problem 1.43. 
(a) Find the divergence of the function 
v = s(2 + sin? @)8 + ssin ọ cos ọ o + 322. 


(b) Test the divergence theorem for this function, using the quarter-cylinder (radius 2, height 
5) shown in Fig. 1.43. 
(c) Find the curl of v. 


1.5 The Dirac Delta Function 
E) 


1.5.1 The Divergence of ĉ/r? 


Consider the vector function 
v= -—r. (1.83) 


At every location, v is directed radially outward (Fig. 1.44); if ever there 
was a function that ought to have a large positive divergence, this is it. And 
yet, when you actually calculate the divergence (using Eq. 1.71), you get 
precisely zero: 

10 1 


1 ð 
; = —— 2 = — — = 
Treca a) =a_ 0 =0. (1.84) 
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Or you can do it in Cartesian coordinates (as in Prob. 1.16). The plot 
thickens when we apply the divergence theorem to this function. Suppose 
we integrate over a sphere of radius R, centered at the origin (Prob. 1.39b); 
the surface integral is 


fy-da= f (74) sino dodge) 


= (["snoao)( f" dp) = 4n. (1.85) 


But the volume integral, f V-y dr, is zero, if we are really to believe Eq. 1.84. 
Does this mean that the divergence theorem is false? What’s going on here? 

The source of the problem is the point r = 0, where v blows up (and 
where, in Eq. 1.84, we have unwittingly divided by zero). It is quite true 
that V-v = 0 everywhere except the origin, but right at the origin the 
situation is more complicated. Notice that the surface integral (Eq. 1.85) 
is independent of R; if the divergence theorem is right (and it is), we should 
get f (V -v)dt = 4r for any sphere centered at the origin, no matter how 
small. Evidently, the entire contribution must be coming from the point 
r = 0! Thus, V -v has the bizarre property that it vanishes everywhere except 
at one point, and yet its integral (over any volume containing that point) is 
4r. No ordinary function behaves like that. (On the other hand, a physical 
example does come to mind: the density — mass per unit volume — of a point 
particle. It’s zero except at the exact location of the particle, and yet its 
integral is finite — namely, the mass of the particle.) What we have stumbled 
on is a mathematical object known to physicists as the Dirac delta function. 
It arises in many branches of theoretical physics. Moreover, the specific 
problem at hand (the divergence of the function f/r?) is not just some arcane 
curiosity — it is, in fact, central to the whole theory of electrodynamics. So 
it is worthwhile pausing here and studying the Dirac delta function with 
some care. 


1.5.2 The One-Dimensional Dirac Delta Function 


The one-dimensional Dirac delta function, 6(x), can be pictured as an 
infinitely high, infinitesimally narrow “spike,” with area 1 (Fig. 1.45). That 
is to say, 
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A 
O(x) 
ua Area 1 
0 x 
A 
ze R(x) 
R(x) 
I 
—1/2-1/4 1/4 1/2 x -1 -1/2 1/2 1 x 
(a) (b) 
0 ifx#0, 
6(x) = 1.86 
2 (. if x = 0, ae 
and!! 
f 6(x) dx = 1. (1.87) 


Technically, 5(x) is not a function at all, since its value is not finite at x = 
0; in the mathematical literature it is known as a generalized function, or 
distribution. It is, if you like, the limit of a sequence of functions, such as 
rectangles R,,(x), of height n and width 1/n, or isosceles triangles T;,(x), of 
height n and base 2/n (Fig. 1.46). 

If f(x) is some “ordinary” function (that is, not another delta function — 
in fact, just to be on the safe side, let’s say that f(x) is continuous), then the 
product f (x)6(x) is zero everywhere except at x = 0. It follows that 


SE = f(0)6(x). (1.88) 


(This is the most important fact about the delta function, so make sure 
you understand why it is true: since the product is zero anyway except at 
x = 0, we may as well replace f(x) by the value it assumes at the origin.) In 
particular, 


f F(x)d(x) dx =f 0) f 5(x) dx = f (0). (1.89) 


11 Notice that the dimensions of 5(x) are 1 over the dimensions of its argument; if x is a 
length, 6(x) carries the units m!, 
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A 
ôx- a) 


ua Area | 


Under an integral, then, the delta function “picks out” the value of f(x) 
at x=0. (Here and below, the integral need not run from —co to +09; it 
is sufficient that it straddle the delta function, and —e to +e would do 
as well.) 

Of course, we can shift the spike from x = 0 to some other point, x = a 
(Fig. 1.47): 


0 if j 
v=) ={ da with f 5(x— a) dx = 1. (1.90) 
co ifx=a, -00 
Equation 1.88 becomes 
f(x) 6(x — a) = f (a)ô(x - a), (1.91) 


and Eq. 1.89 generalizes to 


f feo 0 dx = f(a). (1.92) 


Example 1.14. Evaluate the integral 


3 
f x°6(x — 2) dx. 
0 


Solution. The delta function picks out the value of x° at the point 
x = 2, so the integral is 2° = 8. Notice, however, that if the upper limit 
had been 1 (instead of 3), the answer would be 0, because the spike 
would then be outside the domain of integration. 


Although 6 itself is not a legitimate function, integrals over 6 are per- 
fectly acceptable. In fact, it’s best to think of the delta function as something 
that is always intended for use under an integral sign. In particular, two 


1.5 The Dirac Delta Function 


expressions involving delta functions (say, Dı (x) and D2(x)) are considered 
equal if! 


f TODE { PX) Dax) dx (1.93) 


for all (“ordinary”) functions f(x). 


Example 1.15. Show that 


1 
O(kx) = moe (1.94) 


where k is any (nonzero) constant. (In particular, 6(—x) = 6(x): it’s an 
even function.) 


Solution. For an arbitrary test function f(x), consider the integral 


fS row dx. 


Changing variables, we let y = kx, so that x = y/k, and dx = (1/k) dy. 
If k is positive, the integration still runs from —co to +00, but if k is 
negative, then x = œ implies y = —oo, and vice versa, so the order of 
the limits is reversed. Restoring the “proper” order costs a minus sign. 
Thus 


= aa d 1 1 
{ f()d(kx) dx = + { SOAK) = £7 fO) = gO 


(The lower signs apply when k is negative, and we account for this 
neatly by putting absolute value bars around the final k, as indicated.) 
Under the integral sign, then, ô(kx) serves the same purpose as 
(IKI) ôx): 


= = 1 
f f (x)6(kx) dx = f folga] dx. 


According to the criterion Eq. 1.93, therefore, ô(kx) and (1/|k|)d(x) 
are equal. 


Problem 1.44. Evaluate the following integrals: 
(a) f,’Gx? - 2x - 1)6(x - 3) dx. 
(b) ie cos x6(x — T) dx. 
iG) ie x75 (x + 1) dx. 
(d) [> In(x + 3)6(x + 2) dx. 
12 Temphasize that the integrals must be equal for any f (x). Suppose Dy (x) and Da(x) actually 


differed, say, in the neighborhood of the point x = 17. Then we could pick a function f(x) 
that was sharply peaked about x = 17, and the integrals would not be equal. 
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Problem 1.45. Evaluate the following integrals: 
(a) [Zx+ 3)6(3x) dx. 
(b) he + 3x +2)6(1 — x) dx. 
O f} 9x26 Bx + 1) dx. 
(d) f“ ôx- b) dx. 
Problem 1.46. 
(a) Show that 
d 
XO) = —6(X). 
X 


[Hint: Use integration by parts.] 
(b) Let O (x) be the (Heaviside) step function: 


1 ifx>0, 
O(x) = i (1.95) 
0 ifx<0. 


Show that d@/dx = d(x). 


1.5.3 The Three-Dimensional Delta Function 


It is easy to generalize the delta function to three dimensions: 
d3(r) = 6(x) ô) ô(2). (1.96) 
(As always, r = xX + yy + zĉ is the position vector, extending from the 


origin to the point (x, y, z).) This three-dimensional delta function is zero 
everywhere except at (0, 0,0), where it blows up. Its volume integral is 1: 


Podr ff [ooo ot2drdyae=1. (1.97) 


all space 


And, generalizing Eq. 1.92, 


f(r -a) dt = f(a). (1.98) 


all space 


As in the one-dimensional case, integration with 6 picks out the value of 
the function f at the location of the spike. 

We are now in a position to resolve the paradox introduced in 
Section 1.5.1. As you will recall, we found that the divergence of f/r? is 
zero everywhere except at the origin, and yet its integral over any volume 
containing the origin is a constant (to wit, 477). These are precisely the 
defining conditions for the Dirac delta function; evidently, 


v. (4) =4n5°(r). (1.99) 
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More generally, 


y. (5) = 4n53(2), (1.100) 


where, as always, 2 is the separation vector: # = r — r’. Note that differenti- 
ation here is with respect to r, while r’ is held constant. Incidentally, since 


1 a 
v(—) =-— 1.101 
lee (1.101) 
(Prob. 1.13b), it follows that 
1 
i = -4n ð (*). (1.102) 


Example 1.16. Evaluate the integral 


J= | +90: ($)dr, 


where V is a sphere!’ of radius R centered at the origin. 


Solution 1. Use Eq. 1.99 to rewrite the divergence, and Eq. 1.98 to do 
the integral: 


J= fe + 2)4n5°(r) dt = 4n(0 +2) = 87. 
Vv 


This one-line solution demonstrates something of the power and 
beauty of the delta function, but I would like to show you a second 
method, which is much more cumbersome but serves to illustrate the 
method of integration by parts (Section 1.3.6). 


Solution 2. Using Eq. 1.59, we transfer the derivative from f/r? to 
(7? +2): 
f 2 2 f 
J=- =a IVF +2))dt+ Qr +2); - da. 
yr S r 
The gradient is 
Va? +2) = 2f, 


so the volume integral becomes 


2 2a, z 2 
> = sin 0 dr d0 do = 8n rdr = 4n R4. 
0 


Meanwhile, on the boundary of the sphere (where r = R), 
da = R’ sin 0 d0 dof, 


In proper mathematical jargon, “sphere” denotes the surface, and “ball” the volume it 
encloses. But physicists are (as usual) sloppy about this sort of thing, and I use the word 
“sphere” for both the surface and the volume. Where the meaning is not clear from the 
context, I will write “spherical surface” or “spherical volume.” The language police tell me 
that the former is redundant and the latter an oxymoron, but a poll of my physics colleagues 
reveals that this is (for us) the standard usage. 
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so the surface integral is 
f +2)sin 0 d0 do = 4n (R +2). 


Putting it all together, 
J = —4nR? +4n(R’ +2) = 87, 


as before. 


Problem 1.47. 


(a) Write an expression for the volume charge density p (r) of a point charge q at r’. Make 
sure that the volume integral of p equals q. 

(b) What is the volume charge density of an electric dipole, consisting of a point charge —q at 
the origin and a point charge +q at a? 

(c) What is the volume charge density (in spherical coordinates) of a uniform, infinitesimally 
thin spherical shell of radius R and total charge Q, centered at the origin? [Beware: The 
integral over all space must equal Q.] 


Problem 1.48. Evaluate the following integrals: 


(a) fr? +r -a+ a’)0? (r — a) dt, where a is a fixed vector, a is its magnitude, and the 
integral is over all space. 

(b) hy |r — b|253(5r) dt, where Vis a cube of side length 2, centered on the origin, and 
b = 49 + 32. 

(@) ha” +r? (r - c) + c*]ô? (r — c) dt, where Vis a sphere of radius 6 about the origin, 
c = 5x + 3y + 22, and c is its magnitude. 

(d) fyr- (d- r)ô?(e -— r) dt, whered = (1,2, 3),e = (3, 2, 1), and Vis a sphere 
of radius 1.5 centered at (2, 2, 2). 


Problem 1.49. Evaluate the integral 


f 
J= iv. —)d 
{ie ( =) t 
(where V is a sphere of radius R, centered at the origin) by two different methods, as 
in Ex. 1.16. 


1.6 The Theory of Vector Fields 


This section anticipates issues that will arise in later chapters. I include 
it here for completeness, and to help you distinguish purely mathematical 
considerations from the physical context in which they occur. Read it now 
to get a sense of the “lay of the land,” and refer back to it as needed. 
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1.6.1 The Helmholtz Theorem 


Ever since Faraday, the laws of electricity and magnetism have been 
expressed in terms of electric and magnetic fields, E and B. Like many 
physical laws, these are most compactly expressed as differential equations. 
Since E and Bare vectors, the differential equations naturally involve vector 
derivatives: divergence and curl. Indeed, Maxwell reduced the entire theory 
to four equations, specifying respectively the divergence and the curl of E 
and B. 

Maxwell’s formulation raises an important mathematical question: To 
what extent is a vector function determined by its divergence and curl? In 
other words, if I tell you that the divergence of F (which stands for E or B, 
as the case may be) is a specified (scalar) function D, 


V-F=D, 

and the curl of F is a specified (vector) function C, 
VxF=C 

(for consistency, C must be divergenceless, 
V-C=0, 


because the divergence of a curl is always zero), can you then determine the 
function F? 

Well ... not quite. For example, as you may have discovered in Prob. 1.20, 
there are many functions whose divergence and curl are both zero every- 
where — the trivial case F = 0, of course, but also F = yzx + zxy + xyz, F = 
sin x cosh yÅ — cos xsinhy f, etc. To solve a differential equation you must 
also be supplied with appropriate boundary conditions. In electrodynamics 
we typically require that the fields go to zero “at infinity” (far away from all 
charges).'* With that extra information, the Helmholtz theorem guarantees 
that the field is uniquely determined by its divergence and curl. (The 
Helmholtz theorem is discussed in Appendix B.) 


1.6.2 Potentials 


If the curl of a vector field (F) vanishes (everywhere), then F can be written 
as the gradient of a scalar potential (V): 


VxF=0 © F=-VVP. (1.103) 


(The minus sign is purely conventional.) That’s the essential burden of the 
following theorem: 


14 Tn some textbook problems, the charge itself extends to infinity (we speak, for instance, of 
the electric field of an infinite plane, or the magnetic field of an infinite wire). In such cases 
the normal boundary conditions do not apply, and one must invoke symmetry arguments 
to determine the fields uniquely. 
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Theorem 1. Curl-less (or “irrotational’) fields. The following condi- 
tions are equivalent (that is, F satisfies one if and only if it satisfies all 
the others): 


(a) V x F = 0 everywhere. 
(b) S "F. dis independent of path, for any given end points. 


(c) $F - dl = 0 for any closed loop. 
(d) F is the gradient of some scalar function: F = -V V. 


The potential is not unique — any constant can be added to V with 
impunity, since this will not affect its gradient. 

If the divergence of a vector field (F) vanishes (everywhere), then F can 
be expressed as the curl of a vector potential (A): 


V-F=0 © F=VxA. (1.104) 
That’s the main conclusion of the following theorem: 


Theorem 2. Divergenceless (or “solenoidal”) fields. The following con- 
ditions are equivalent: 

(a) V- F = 0 everywhere. 

(b) f F - da is independent of surface, for any given boundary line. 


(c) $ F - da = 0 for any closed surface. 
(d) F is the curl of some vector function: F = V x A. 


The vector potential is not unique — the gradient of any scalar function 
can be added to A without affecting the curl, since the curl of a gradient 
is Zero. 

You should by now be able to prove all the connections in these theorems, 
save for the ones that say (a), (b), or (c) implies (d). Those are more 
subtle, and will come later. Incidentally, in all cases (whatever its curl and 
divergence may be) a vector field F can be written as the gradient of a scalar 
plus the curl of a vector:!> 


F=-VV+VxA_ (always). (1.105) 


Problem 1.50. 


(a) Let Fy = x?zand F) = xX + yy + 22. Calculate the divergence and curl of F and 
F2. Which one can be written as the gradient of a scalar? Find a scalar potential that does 
the job. Which one can be written as the curl of a vector? Find a suitable vector potential. 

(b) Show that Fz = yzx + zxy + xyz can be written both as the gradient of a scalar and 
as the curl of a vector. Find scalar and vector potentials for this function. 


'5 In physics, the word field denotes generically any function of position (x, y, z) and time (£). 
But in electrodynamics, two particular fields (E and B) are of such paramount importance 
as to preempt the term. Thus, technically, the potentials are also “fields,” but we never call 
them that. 
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Problem 1.51. For Theorem 1, show that (d) = (a), (a) > (c), () => (b), (b) = (c), 
and (c) = (a). 


Problem 1.52. For Theorem 2, show that (d) => (a), (a) > (0), (0 = (b), (b) = (c), 
and (c) = (a). 


Problem 1.53. 


(a) Which of the vectors in Prob. 1.15 can be expressed as the gradient of a scalar? Find a scalar 
function that does the job. 
(b) Which can be expressed as the curl of a vector? Find such a vector. 


More Problems on Chapter 1 
Enl) 


Problem 1.54. Check the divergence theorem for the function 
v =r" cos 0f +17 coso Ô -r° cos Osing Q, 


using as your volume one octant of the sphere of radius R (Fig. 1.48). Make sure you include the 
entire surface. [Answer: r: R*/4.] 


Problem 1.55. Check Stokes’ theorem using the function v = ay + bxy (a and b are 
constants) and the circular path of radius R, centered at the origin in the xy plane. [Answer: 
mR? (b — a)] 
Problem 1.56. Compute the line integral of 

v= 6K +yz ĵ t+ Gy + 2)% 


along the triangular path shown in Fig.1.49. Check your answer using Stokes’ theorem. 
[Answer: 8/3.] 


Problem 1.57. Compute the line integral of 
v = (r cos? @)f — (r cos 0 sin 0) + 3rp 


around the path shown in Fig. 1.50 (the points are labeled by their Cartesian coordinates). Do 
it either in cylindrical or in spherical coordinates. Check your answer, using Stokes’ theorem. 
[Answer: 377/2.] 


Problem 1.58. Check Stokes’ theorem for the function v = y2, using the triangular surface 
shown in Fig. 1.51. [Answer: a°] 


Problem 1.59. Check the divergence theorem for the function 


v=r'sin 0f + 4r? cos 0 Ô +r’ tanO @, 
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Fig. 1.48 Fig. 1.50 


Fig. 1.52 


using the volume of the “ice-cream cone” shown in Fig. 1.52 (the top surface is spherical, with 
radius R and centered at the origin). [Answer: (7. R4/12) (27 + 3V3).] 


Problem 1.60. Here are two cute checks of the fundamental theorems: 


(a) Combine Corollary 2 to the gradient theorem with Stokes’ theorem (v = V T in this case). 
Show that the result is consistent with what you already knew about second derivatives. 

(b) Combine Corollary 2 to Stokes’ theorem with the divergence theorem. Show that the result 
is consistent with what you already knew. 


Problem 1.61. Although the gradient, divergence, and curl theorems are the fundamental 
integral theorems of vector calculus, it is possible to derive a number of corollaries from them. 
Show the following: 


(a) (VT) dt= $, T da. (Hint: Let v = cT, where c is a constant vector, in the diver- 
gence theorem; use the product rules.] 

(b) foo xv) dt =- fs v x da. [Hint: Replace v by (v x c) inthe divergence theorem.] 

(0 PAA U + (YT) - (VU)] dt = $.(TVU) - da. [Hint: Letv = TVU inthe 
divergence theorem.] 

(d) f {TY U-UV?T) dt = $.(TVU-UVT).-da. [Comment:Thisis sometimes 
called Green’s second identity; it follows from (c), which is known as Green’s identity.] 

(e) [, VI x da=- $p T dl. [Hint: Letv = cT in Stokes’ theorem.] 

(f) Jsl(da x V) xv] =- $pY x dl. [Hint: Replace v by (v x c) in Stokes’ theorem.] 
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Problem 1.62. The integral 


= 1.1 
a [ia (1.106) 


is sometimes called the vector area of the surface S. If S happens to be flat, then |a] is the 
ordinary (scalar) area, obviously. 


(a) Find the vector area of a hemispherical bowl of radius R. 

(b) Show that a = 0 for any closed surface. [Hint: Use Prob. 1.61a.] 

(c) Show that a is the same for all surfaces sharing the same boundary. 
(d) Show that 


a=3 Orxa, (1.107) 


where the integral is around the boundary line. [Hint: One way to do it is to draw the 
cone subtended by the loop at the origin. Divide the conical surface up into infinitesimal 
triangular wedges, each with vertex at the origin and opposite side d/l, and exploit the 
geometrical interpretation of the cross product (Fig. 1.8).] 

(e) Show that 


fie-ndl=axe (1.108) 
for any constant vector c. [Hint: Let T = c - rin Prob. 1.61e.] 
Problem 1.63. 


(a) Find the divergence of the function 


| => 


y= 
r 


First compute it directly, as in Eq. 1.84. Test your result using the divergence theorem, 
as in Eq. 1.85. Is there a delta function at the origin, as there was for ĉ/r?? What is the 
general formula for the divergence of rr? [Answer V - (r”t) = (n + 2)r7“!, unless 
n = —2, in which case it is 4718? (r); for n < —2, the divergence is ill defined at the 
origin.] 

(b) Find the curl of rr. Test your conclusion using Prob. 1.61b. [Answer: V x (r"r) = 0.] 


Problem 1.64. In case you're not persuaded that V7(1/r) = —47253(r) (Eq. 1.102, with 
r’ = O for simplicity), try replacing r by Vr2 + €2, and watching what happens as e — 0." 
Specifically, let 


[eel 
Dee) =~ Jara 


16 This problem was suggested by Frederick Strauch. 
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To demonstrate that this goes to 53 (r) as € — 0: 


show that D(r, €) = (3€2/471) (r? + €2) 79/2; 
check that D(O, €) > œas e > 0; 

check that D(r, €) — 0,as e — 0, forall + 0; 
check that the integral of D(r, €) over all space is 1. 


(a 
(b 
(c 
(d 


SSS 


Problem 1.65. (a) Check Stokes’ theorem for the vector function 
-y + xy 
A= ———, 
x? +? 
using a circle of radius R in the xy plane. Diagnose the problem, and fix it by correcting 
the curl of A. Use Cartesian coordinates. 
(b) Convert A to cylindrical coordinates, and repeat part (a), this time doing everything in 


terms of s, @, and z. 


Electrostatics 


2.1 The Electric Field 


| 
2.1.1 Introduction 


The fundamental problem electrodynamics hopes to solve is this (Fig. 2.1): 
We have some electric charges, q1,q2,q3,... (call them source charges); 
what force do they exert on another charge, Q (call it the test charge)? 
The positions of the source charges are given (as functions of time); the 
trajectory of the test particle is to be calculated. In general, both the source 
charges and the test charge are in motion. 

The solution to this problem is facilitated by the principle of superposi- 
tion, which states that the interaction between any two charges is completely 
unaffected by the presence of others. This means that to determine the force 
on Q, we can first compute the force F;, due to qı alone (ignoring all the 
others); then we compute the force F3, due to gz alone; and so on. Finally, 
we take the vector sum of all these individual forces: F = F; + F,+F3+---. 
Thus, if we can find the force on Q due to a single source charge q, we are, in 
principle, done (the rest is just a question of repeating the same operation 
over and over, and adding it all up).! 

Well, at first sight this looks very easy: Why don’t I just write down the 
formula for the force on Q due to q, and be done with it? I could, and in 
Chapter 10 I shall, but you would be shocked to see it at this stage, for 
not only does the force on Q depend on the separation distance 2 between 
the charges (Fig. 2.2), it also depends on both their velocities and on the 
acceleration of q. Moreover, it is not the position, velocity, and acceleration 
of q right now that matter: electromagnetic “news” travels at the speed of 
light, so what concerns Q is the position, velocity, and acceleration q had at 
some earlier time, when the message left. 

Therefore, in spite of the fact that the basic question (“What is the force 
on Q due to q?”) is easy to state, it does not pay to confront it head on; 
rather, we shall go at it by stages. In the meantime, the theory we develop 


1 The principle of superposition may seem “obvious” to you, but it did not have to be so 
simple: if the electromagnetic force were proportional to the square of the total source 
charge, for instance, the principle of superposition would not hold, since (q1 +42)? # qi + 
(there would be “cross terms” to consider). Superposition is not a logical necessity, but an 
experimental fact. 
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e bd Q 
h° “Test” charge 
giS oR is 2 
° eq; a 
. 
“Source” charges q 


will allow for the solution of more subtle electromagnetic problems that do 
not present themselves in quite this simple format. To begin with, we shall 
consider the special case of electrostatics in which all the source charges are 
stationary (though the test charge may be moving). 


2.1.2 Coulomb's Law 
What is the force on a test charge Q due to a single point charge q, that is 


at rest a distance 2 away? The answer (based on experiments) is given by 
Coulomb’s law: 


1 qQ. 
= A. 2.1 
ÅTEo a? ( ) 


The constant €ọ is called (ludicrously) the permittivity of free space. In SI 
units, where force is in newtons (N), distance in meters (m), and charge in 
coulombs (C), 
2 
eo = 8.85 x 10-7 

In words, the force is proportional to the product of the charges and 
inversely proportional to the square of the separation distance. As always 
(Section 1.1.4), 21s the separation vector from r’ (the location of q) tor (the 
location of Q): 


a=r-r; (2.2) 


is its magnitude, and 7 is its direction. The force points along the line from 
q to Q; it is repulsive if g and Q have the same sign, and attractive if their 
signs are opposite. 

Coulomb’s law and the principle of superposition constitute the physical 
input for electrostatics — the rest, except for some special properties of 
matter, is mathematical elaboration of these fundamental rules. 


Problem 2.1. 


(a) Twelve equal charges, q, are situated at the corners of a regular 12-sided polygon (for 
instance, one on each numeral of a clock face). What is the net force on a test charge Q 
at the center? 
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(b) Suppose one of the 12 qs is removed (the one at “6 o'clock”). What is the force on Q? Explain 
your reasoning carefully. 

(c) Now 2 equal charges, q, are placed at the corners of a regular 13-sided polygon. What is the 
force on a test charge Q at the center? 

(d) If one of the 13 qs is removed, what is the force on Q? Explain your reasoning. 


2.1.3 The Electric Field 


If we have several point charges q1, q2,..-,Gn, at distances 21, 22,...,2y 
from Q, the total force on Q is evidently 


1 x 
F=F,+kFo.+-:-= EnO O TEO TO 
4TE0 a 2, 
= Q (Li+ Bi + Bi), 
Amey at ām 
or 
F = OE, (2.3) 
where 
1 n qi a 
E(r)= Z Ai. 2.4 
a 24) 


The quantity E is called the electric field of the source charges. Notice that 
it is a function of position (r), because the separation vectors 2; depend on 
the location of the field point P (Fig. 2.3). But it makes no reference to the 
test charge Q. The electric field is a vector quantity that varies from point to 
point and is determined by the configuration of source charges; physically, 
E(r) is the force per unit charge that would be exerted on a test charge, if 
you were to place one at P. 

What exactly is an electric field? I have deliberately begun with what 
you might call the “minimal” interpretation of E, as an intermediate 
step in the calculation of electric forces. But I encourage you to think 
of the field as a “real” physical entity, filling the space around electric 


Source point 
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charges. Maxwell himself came to believe that electric and magnetic fields 
are stresses and strains in an invisible primordial jellylike “ether.” Special 
relativity has forced us to abandon the notion of ether, and with it Maxwell’s 
mechanical interpretation of electromagnetic fields. (It is even possible, 
though cumbersome, to formulate classical electrodynamics as an “action- 
at-a-distance” theory, and dispense with the field concept altogether.) I can’t 
tell you, then, what a field is — only how to calculate it and what it can do 
for you once you’ve got it. 


Example 2.1. Find the electric field a distance z above the midpoint 
between two equal charges (q), a distance d apart (Fig. 2.4a). 


Solution. Let E; be the field of the left charge alone, and E, that 
of the right charge alone (Fig. 2.4b). Adding them (vectorially), the 
horizontal components cancel and the vertical components conspire: 


q 
=2— £ wos 9, 
Frega 


Here 2 = Vz? + (d/2)2 and cos 0 = 2/2, so 


o 1l 2qz 5 
~ Amey |22 + (d/2)2]32~ 


Check: When z > d you're so far away that it just looks like a single 
charge 2q, so the field should reduce to E = (2q/4m€z7)z. And it does 
(just set d > 0 in the formula). 


Problem 2.2. Find the electric field (magnitude and direction) a distance z above the mid- 
point between equal and opposite charges (+q), a distance d apart (same as Ex. 2.1, except 
that the charge at x = +d/2 is —q). 
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2.1.4 Continuous Charge Distributions 


Our definition of the electric field (Eq. 2.4) assumes that the source of the 
field is a collection of discrete point charges q;. If, instead, the charge is 
distributed continuously over some region, the sum becomes an integral 
(Fig. 2.5a): 
1 1. 
E(r) = zS rites (2.5) 


4T E0 


If the charge is spread out along a line (Fig. 2.5b), with charge-per-unit- 
length A, then dq = A dl’ (where dl’ is an element of length along the line); 
if the charge is smeared out over a surface (Fig. 2.5c), with charge-per-unit- 
area o, then dq = o da’ (where da’ is an element of area on the surface); and 
if the charge fills a volume (Fig. 2.5d), with charge-per-unit-volume p, then 
dq = p dt’ (where dt’ is an element of volume): 


dq > Àd! ~ oda’ ~ pdt’. 


Thus the electric field of a line charge is 


1 Ar’) a 
E(r) = dl’; 2: 
== | Paar: (2.6) 
for a surface charge, 
1 olr’), 
E(r) = da’; 2. 
= ~~ | Pha: 2.7) 
and for a volume charge, 
1 pr’). 
E(r) = dt’. 2. 
) ATE a ry 22) 


Equation 2.8 itself is often referred to as “Coulomb’s law,” because it is 
such a short step from the original (Eq. 2.1), and because a volume charge 
is in some sense the most general and realistic case. Please note carefully 


ae ep 
a 
J n 


(a) Continuous (b) Line charge, À 
distribution 


aP 
^ 


(c) Surface charge, o (d) Volume charge, p 
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the meaning of 2 in these formulas. Originally, in Eq. 2.4, 2; stood for the 
vector from the source charge q; to the field point r. Correspondingly, in 
Eqs. 2.5-2.8, % is the vector from dg (therefore from dl’, da’, or dt’) to the 
field point r.? 


Example 2.2. Find the electric field a distance z above the midpoint of 
a straight line segment of length 2Z that carries a uniform line charge 
A (Fig. 2.6). 


Solution. The simplest method is to chop the line into symmetrically 
placed pairs (at +x), quote the result of Ex. 2.1 (with d/2 > x, q > 
A dx), and integrate (x: 0 > L). But here’s a more general approach: 


r=zzZ, r' =x, dl =dx; 


a Waar? 
L a o g 
E- 1 f A zî-xŠ dx 
4reo J-L 2 +2? V2 4 2 
A L 1 S x 
= zł == [Soe d. -$ -a aoe d. | 
aal f, (22 + x?)32 x f, (22 + x?)3? á 
A : x L 1 L 
= ee (ASSL 8S) 
Ameo z2fz2 4 27 L Vr2 4 x27 -L 
o 1 2AL 5 
An€o -V224 72 
For points far from the line (z > L), 
_ 1 2AL 
i 4TE9 z2 


2 Warning: The unit vector 2 is not constant; its direction depends on the source point r’, and 
hence it cannot be taken outside the integrals (Eqs. 2.5-2.8). In practice, you must work with 
Cartesian components (X, Y, Z are constant, and do come out), even if you use curvilinear 
coordinates to perform the integration. 

3 Ordinarily I’ll put a prime on the source coordinates, but where no confusion is likely to 
arise [ll remove the prime to simplify the notation. 
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This makes sense: From far away the line looks like a point charge 
q = 2AL. In the limit L — œ, on the other hand, we obtain the field 
of an infinite straight wire: 


1 2A 
~ Aney Z ` 


(2.9) 


Problem 2.3. Find the electric field a distance z above one end of a straight line segment of 
length Z (Fig. 2.7) that carries a uniform line charge A. Check that your formula is consistent 
with what you would expect for the case z > L. 


Problem 2.4. Find the electric field a distance z above the center of a square loop (side length 
a) carrying a uniform line charge A (Fig. 2.8). [Hint: Use the result of Ex. 2.2.] 


Problem 2.5. Find the electric field a distance z above the center of a circular loop of radius r 
(Fig. 2.9) that carries a uniform line charge A. 


Problem 2.6. Find the electric field a distance z above the center of a flat circular disk of radius 
R (Fig. 2.10) that carries a uniform surface charge o. What does your formula give in the limit 
R — ov? Also check the case z >> R. 


qe 
e l °P 
I lz | 
i z 
| 
E 
= éE © 
Fig. 2.7 Fig. 2.8 Fig. 2.9 


Fig. 2.10 


! Problem 2.7. Find the electric field a distance z from the center of a spherical surface of radius 


R (Fig. 2.11) that carries a uniform charge density ø. Treat the case z < R (inside) as well as 
z > R (outside). Express your answers in terms of the total charge q on the sphere. [Hint: 
Use the law of cosines to write 2 in terms of R and 0. Be sure to take the positive square root: 


VR? + 22 — 2Rz = (R - Z) if R > z, butits (z — R) if R < z] 
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Problem 2.8. Use your result in Prob. 2.7 to find the field inside and outside a solid sphere of 
radius R that carries a uniform volume charge density p. Express your answers in terms of the 
total charge of the sphere, q. Draw a graph of [E] as a function of the distance from the center. 


2.2 Divergence and Curl of Electrostatic Fields 
aa as __ __ _— || 


2.2.1 Field Lines, Flux, and Gauss’s Law 


In principle, we are done with the subject of electrostatics. Equation 2.8 tells 
us how to compute the field of a charge distribution, and Eq. 2.3 tells us 
what the force on a charge Q placed in this field will be. Unfortunately, 
as you may have discovered in working Prob. 2.7, the integrals involved 
in computing E can be formidable, even for reasonably simple charge 
distributions. Much of the rest of electrostatics is devoted to assembling 
a bag of tools and tricks for avoiding these integrals. It all begins with the 
divergence and curl of E. I shall calculate the divergence of E directly from 
Eq. 2.8, in Section 2.2.2, but first I want to show you a more qualitative, 
and perhaps more illuminating, intuitive approach. 

Lets begin with the simplest possible case: a single point charge q, 
situated at the origin: 


E(r) = 1i, (2.10) 


To get a “feel” for this field, I might sketch a few representative vectors, as 
in Fig. 2.12a. Because the field falls off like 1/7?, the vectors get shorter as 
you go farther away from the origin; they always point radially outward. 
But there is a nicer way to represent this field, and that’s to connect up 
the arrows, to form field lines (Fig. 2.12b). You might think that I have 
thereby thrown away information about the strength of the field, which 
was contained in the lengths of the arrows. But actually I have not. The 
magnitude of the field is indicated by the density of the field lines: it’s strong 


4 
x t a 
X A 7 
oP as E E 
Zi. 
x Y `X 
x KN 


ta (b) 
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near the center where the field lines are close together, and weak farther out, 
where they are relatively far apart. 

In truth, the field-line diagram is deceptive, when I draw it on a two- 
dimensional surface, for the density of lines passing through a circle of 
radius r is the total number divided by the circumference (n/27r), which 
goes like (1/r), not (1/r?).* But if you imagine the model in three dimensions 
(a pincushion with needles sticking out in all directions), then the density of 
lines is the total number divided by the area of the sphere (n/4rr?), which 
does go like (1/r?). 

Such diagrams are also convenient for representing more complicated 
fields. Of course, the number of lines you draw depends on how lazy you 
are (and how sharp your pencil is), though you ought to include enough to 
get an accurate sense of the field, and you must be consistent: if q gets 8 
lines, then 2g deserves 16. And you must space them fairly — they emanate 
from a point charge symmetrically in all directions. Field lines begin on 
positive charges and end on negative ones; they cannot simply terminate 
in midair, though they may extend out to infinity. Moreover, field lines 
can never cross — at the intersection, the field would have two different 
directions at once! With all this in mind, it is easy to sketch the field of 
any simple configuration of point charges: begin by drawing the lines in the 
neighborhood of each charge, and then connect them up or extend them to 
infinity (Figs. 2.13 and 2.14). 

In this model, the flux of E through a surface S, 


Ops | Eda, (2.11) 
S 


is a measure of the “number of field lines” passing through S. I put this 
in quotes because of course we can only draw a representative sample of 
the field lines — the total number would be infinite. But for a given sampling 


Opposite charges 


4 For a critique of electric field-line diagrams, see A. Wolf et al., Am. J. Phys. 64, 714 (1996). 

5 If they did, the divergence of E would not be zero, and (as we shall soon see) that cannot 
happen in empty space. Of course, in a region with charge density p + 0 some field lines will 
terminate. 
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Equal charges 


rate the flux is proportional to the number of lines drawn, because the field 
strength, remember, is proportional to the density of field lines (the number 
per unit area), and hence E - da is proportional to the number of lines 
passing through the infinitesimal area da. (The dot product picks out the 
component of da along the direction of E, as indicated in Fig. 2.15. It is 
the area in the plane perpendicular to E that we have in mind when we say 
that the density of field lines is the number per unit area.) 

This suggests that the flux through any closed surface is a measure of 
the total charge inside, for the field lines that originate on a positive charge 
must either pass out through the surface or else terminate on a negative 
charge inside (Fig. 2.16a). On the other hand, a charge outside the surface 
will contribute nothing to the total flux, since its field lines pass in one side 
and out the other (Fig. 2.16b). This is the essence of Gauss’s law. Now let’s 
make it quantitative. 


WA da 


(a) (b) 
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In the case of a point charge q at the origin, the flux of E through a 
spherical surface of radius r is 


_ 1 q 2 os ay A 
PE a= flA (r° sin 0 dO dot) = at (2.12) 


72 
Notice that the radius of the sphere cancels out, for while the surface area 
goes up as r’, the field goes down as 1/r?, so the product is constant. In terms 
of the field-line picture, this makes good sense, since the same number of 
field lines pass through any sphere centered at the origin, regardless of its 
size. In fact, it didn’t have to be a sphere — any closed surface, whatever its 
shape, would be pierced by the same number of field lines. Evidently, the 
flux through any surface enclosing the charge is qleo. 

Now suppose that instead of a single charge at the origin, we have a 
bunch of charges scattered about. According to the principle of superposi- 
tion, the total field is the (vector) sum of all the individual fields: 


E= 5 Ej. 
i=] 


The flux through a surface that encloses them all is 


f E-da = D(f ea) = Yea) 


For any closed surface, then, 


1 
PE: da = = One (2.13) 
€0 


where Qenc is the total charge enclosed within the surface. This is the 
quantitative statement of Gauss’s law. Although it contains no informa- 
tion that was not already present in Coulomb’s law plus the principle of 
superposition, it is of almost magical power, as you will see in Section 2.2.3. 
Notice that it all hinges on the 1/r? character of Coulomb’s law; without 
that the crucial cancelation of the rs in Eq. 2.12 would not occur, and the 
total flux of E would depend on the surface chosen, not merely on the total 
charge enclosed. Other 1/7? forces (I am thinking particularly of Newton’s 
law of universal gravitation) will obey “Gauss’s laws” of their own, and the 
applications we develop here carry over directly. 

As it stands, Gauss’s law is an integral equation, but we can easily turn 
it into a differential one, by applying the divergence theorem: 


PE-da= f (v-Edr 
S v 


Rewriting Qenc in terms of the charge density p, we have 


Ome= | par. 
y 


6 Tf there’s some more charge, outside the surface, it will contribute to the field, of course, but 
not to the flux. 
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So Gauss’s law becomes 


[Dar |(E)ar 
v yo 


And since this holds for any volume, the integrands must themselves be 
equal:’ 


V-E=_5. (2.14) 
€0 


Equation 2.14 carries the same message as Eq. 2.13; it is Gauss’s law in 
differential form. The differential version is tidier, but the integral form has 
the advantage in that it accommodates point, line, and surface charges more 
naturally. 


Problem 2.9. Suppose the electric field in some region is found to be E = Arf, in spherical 
coordinates (/ is some constant). 


(a) Find the charge density p. 
(b) Find the total charge contained in a sphere of radius R, centered at the origin. (Do it two 
different ways.) 


Problem 2.10. A charge q sits at the back corner of a cube, as shown in Fig. 2.17. What is the 
flux of E through the shaded side? 


Problem 2.11. Draw a field-line diagram for a uniformly charged solid sphere. 


2.2.2 The Divergence of E 
Let’s go back, now, and calculate the divergence of E directly from Eq. 2.8: 


1 å , , 
TF f PTIT. (2.15) 


all space 


E(r) = 


7 Think about it: Suppose the integrands were different, in the neighborhood of some point 
P. Then I could choose Y to be a tiny volume centered at P, and the integrals would not be 
equal. 
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(Originally the integration was over the volume occupied by the charge, but 
I may as well extend it to all space, since p = 0 in the exterior region anyway.) 
Noting that the r-dependence is contained in 2 = r — r’, we have 


1 A a 
V-E=7— | Y (Z) ar’ 


This is precisely the divergence we calculated in Eq. 1.100: 


y. (=) = 4n63(a). 


Thus 


1 1 
V-E= Ta farsa -r')o) dt’ = Be (2.16) 
which is Gauss’s law in differential form (Eq. 2.14). To recover the integral 
form (Eq. 2.13), we run the previous argument in reverse — integrate over a 


volume and apply the divergence theorem: 


[v-Edr=pE-da=— f pdt = Oo. 
Vv S €0 Jy €0 


2.2.3 Applications of Gauss’s Law 


I must interrupt the theoretical development at this point to show you 
the extraordinary power of Gauss’s law, in integral form. When symmetry 
permits, it affords by far the quickest and easiest way of computing electric 
fields. PII illustrate the method with a series of examples. 


Example 2.3. Find the field outside a uniformly charged solid sphere 
of radius R and total charge q. 


Solution. Imagine a spherical surface at radius r > R (Fig. 2.18); this 
is called a Gaussian surface in the trade. Gauss’s law says that 


1 
$ E- da = —Qene, 
S €0 


and in this case Qenc = q. At first glance this doesn’t seem to get us 
very far, because the quantity we want (E) is buried inside the surface 
integral. Luckily, symmetry allows us to extract E from under the 
integral sign: E certainly points radially outward, as does da, so we 
can drop the dot product, 


oo 


If you doubt that E is radial, consider the alternative. Suppose, say, that it points due east, at 
the “equator.” But the orientation of the equator is perfectly arbitrary — nothing is spinning 
here, so there is no natural “north-south” axis — any argument purporting to show that E 
points east could just as well be used to show it points west, or north, or any other direction. 
The only unique direction on the surface of a sphere is radial. 
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Gaussian 
surface 


f E-da= f Eiaa, 
S S 


and the magnitude of E is constant over the Gaussian surface, so it 
comes outside the integral: 


fæ da= IE] | da = En. 
S Ss 


Thus 
1 
Ej4nr = — 
|E|47r aa 
or 
l q., 
4T Eg P 


Notice a remarkable feature of this result: the field outside the sphere 
is exactly the same as it would have been if all the charge had been 
concentrated at the center! 


Gauss’s law is always true, but it is not always useful. If p had not been 
uniform (or, at any rate, not spherically symmetrical), or if I had chosen 
some other shape for my Gaussian surface, it would still have been true that 
the flux of E is q/€o, but E would not have pointed in the same direction as 
da, and its magnitude would not have been constant over the surface, and 
without that I cannot pull |E| outside the integral. Symmetry is crucial to 
this application of Gauss’s law. As far as I know, there are only three kinds 
of symmetry that work: 


(1) Spherical symmetry. Make your Gaussian surface a concentric 
sphere. 

(2) Cylindrical symmetry. Make your Gaussian surface a coaxial cylinder 
(Fig. 2.19). 

(3) Plane symmetry. Use a Gaussian “pillbox” that straddles the surface 
(Fig. 2.20). 


Although (2) and (3) technically require infinitely long cylinders, and 
infinite planes, we shall often use them to get approximate answers for 
“long” cylinders or “large” planes, as long as we are not near the edges. 
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Gaussian 
pillbox 


Ss . 
Gaussian 
surface 


Example 2.4. A long cylinder (Fig. 2.21) carries a charge density that 
is proportional to the distance from the axis: p = ks, for some constant 
k. Find the electric field inside this cylinder. 


Solution. Draw a Gaussian cylinder of length / and radius s. For this 
surface, Gauss’s law states that 


1 
E. = — Orne. 
¢ da ae é 


The enclosed charge is 


Qenc = f pdt= fee ds’ dọ dz) = 2ni | s? ds’ = Z nkl. 
0 


(I used the volume element appropriate to cylindrical coordinates, 
Eq. 1.78, and integrated ¢ from 0 to 27, dz from 0 to /. I put a prime 
on the integration variable s’, to distinguish it from the radius s of the 
Gaussian surface.) 


Gaussian 
surface 


Now, symmetry dictates that E must point radially outward, so for 
the curved portion of the Gaussian cylinder we have 


fe -da = f |E| da = |E] f da = |Eļ|2 rsl, 


while the two ends contribute nothing (here E is perpendicular to da). 
Thus, 


12 
JEl2rsl = — E nkl’, 
€0 3 


or, finally, 
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Example 2.5. An infinite plane carries a uniform surface charge o. 
Find its electric field. 


Solution. Draw a “Gaussian pillbox,” extending equal distances above 
and below the plane (Fig. 2.22). Apply Gauss’s law to this surface: 


1 
PE-da = eg oem 


E 


In this case, Qenc = 0A, where A is the area of the lid of the pillbox. 
By symmetry, E points away from the plane (upward for points above, 
downward for points below). So the top and bottom surfaces yield 


fe - da = 2AļE], 


whereas the sides contribute nothing. Thus 


2A|E| = Loa, 
0 


(2.17) 


where ñ is a unit vector pointing away from the surface. In Prob. 2.6, 
you obtained this same result by a much more laborious method. 

It seems surprising, at first, that the field of an infinite plane is 
independent of how far away you are. What about the 1/r? in Coulomb’s 
law? The point is that as you move farther and farther away from 
the plane, more and more charge comes into your “field of view” (a 
cone shape extending out from your eye), and this compensates for the 
diminishing influence of any particular piece. The electric field of a 
sphere falls off like 1/r?; the electric field of an infinite line falls off like 
1/s; and the electric field of an infinite plane does not fall off at all (you 
cannot escape from an infinite plane). 


Although the direct use of Gauss’s law to compute electric fields is 
limited to cases of spherical, cylindrical, and planar symmetry, we can 
put together combinations of objects possessing such symmetry, even if 
the arrangement as a whole is not symmetrical. For example, invoking the 
principle of superposition, we could find the field in the vicinity of two uni- 
formly charged parallel cylinders, or a sphere near an infinite charged plane. 


B 
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Example 2.6. Two infinite parallel planes carry equal but opposite 
uniform charge densities to (Fig. 2.23). Find the field in each of the 
following three regions: (i) to the left of both, (ii) between them, (iii) 
to the right of both. 


Solution. The left-hand plate produces a field (1/2€9)o, which points 
away from it — to the left in region (1) and to the right in regions (ii) 
and (ili) (Fig. 2.24). The right-hand plate, being negatively charged, 
produces a field (1/2€9)o, which points toward it — to the right in 
regions (i) and (ii) and to the left in region (iii). The two fields cancel in 
regions (i) and (ili); they conspire in region (ii). Conclusion: The field 
between the plates is o/e€9, and points to the right; elsewhere it is zero. 


(i) (ii) Gii) 


Problem 2.12. Use Gauss’s law to find the electric field inside and outside a spherical shell of 
radius R that carries a uniform surface charge density o. Compare your answer to Prob. 2.7; 
notice how much quicker and easier Gauss's method is. 


Problem 2.13. Use Gauss’s law to find the electric field inside a uniformly charged solid sphere 
(charge density p). Compare your answer to Prob. 2.8. 


Problem 2.14. Find the electric field a distance s from an infinitely long straight wire that 
carries a uniform line charge A. Compare Eq. 2.9. 


Problem 2.15. Find the electric field inside a sphere that carries a charge density proportional 
to the distance from the center, o = kr, for some constant x. [ Caution: This charge density is 
not uniform, and you must integrate to get the enclosed charge.] 


Problem 2.16. A thick spherical shell carries charge density 


na (a<r<b) 


5 <r< 
r 

(Fig. 2.25). Find the electric field in the following three regions: (i) r < a, (ii) a < r < b, (iii) 

r > b. Plot [E] as a function of r, for the case b = 2a. 
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Fig. 2.27 Fig. 2.28 


Problem 2.17. A long coaxial cable (Fig. 2.26) carries a uniform volume charge density p on 
the inner cylinder (radius a), and a uniform surface charge density o on the outer cylindrical 
shell (radius b). This surface charge is negative and is of just the right magnitude that the cable 
as a whole is electrically neutral. Find the electric field in each of the following three regions: 
(i) inside the inner cylinder (s < a), (ii) between the cylinders (a < s < b), (iii) outside the 
cable (s > b). Plot |E] as a function of s. 


Problem 2.18. Aninfinite plane slab, of thickness 2d, carries a uniform volume charge density 
p (Fig. 2.27). Find the electric field, as a function of y, where y = O at the center. Plot Æ versus 
y, calling Æ positive when E points in the +y direction and negative when it points in the 
—y direction. 


e Problem 2.19. Two spheres, each of radius R and carrying uniform volume charge densities 
+p and —p, respectively, are placed so that they partially overlap (Fig. 2.28). Call the vector 
from the positive center to the negative center d. Show that the field in the region of overlap is 
constant, and find its value. [Hint: Use the answer to Prob. 2.13.] 


2.2.4 The Curl of E 


Pll calculate the curl of E, as I did the divergence in Section 2.2.1, by 
studying first the simplest possible configuration: a point charge at the 
origin. In this case, 


2.2 Divergence and Curl of Electrostatic Fields 


l q 


= f. 
Aner 


Now, a glance at Fig. 2.12 should convince you that the curl of this field 
has to be zero, but I suppose we ought to come up with something a little 
more rigorous than that. What if we calculate the line integral of this field 
from some point a to some other point b (Fig. 2.29): 


b 
fea 


In spherical coordinates, dl = drê +r d0 6+rsin@ do p, so 


l q 
E- dl = = dr. 
Arteg r? pi 


Therefore, 


b b 
1 q -1 gy l 4 q 
E-dl= dr = = — 2.1 
Í 4T Eo Í r? i Atteg r 'ra T A 28) 


where r, is the distance from the origin to the point a and ry is the distance 
to b. The integral around a closed path is evidently zero (for then ra = rp): 


PE-d=0, (2.19) 


and hence, applying Stokes’ theorem (and noting that Eq. 2.19 applies to 
any closed path), 


VxE=0. (2.20) 


Now, I proved Eqs. 2.19 and 2.20 only for the field of a single point 
charge at the origin, but these results make no reference to what is, after 
all, a perfectly arbitrary choice of coordinates; they hold no matter where 
the charge is located. Moreover, if we have many charges, the principle of 
superposition says that the total field is the vector sum of their individual 
fields: 


E=E,+E)+---, 
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so 
VxE=Vx(E,+E,+---)=(VxE,)+(VxE,)+-:--=0. 


Thus, Eqs. 2.19 and 2.20 hold for any (static) charge distribution whatever. 


Problem 2.20. Calculate V x E directly from Eq. 2.8, by the method of Section 2.2.2. Refer to 
Prob. 1.63b if you get stuck. 


2.3 Electric Potential 


SESS 
2.3.1 Introduction to Potential 


The electric field E is not just any old vector function. It is a very special 
kind of vector function: one whose curl is zero. The vector E = yx, for 
example, could not possibly be an electrostatic field; no set of charges, 
regardless of their sizes and positions, could ever produce such a field. We’re 
going to exploit this special property of electric fields to reduce a vector 
problem (finding E) to a much simpler scalar problem. The first theorem 
in Section 1.6.2 asserts that any vector whose curl is zero is equal to the 
gradient of some scalar. What I’m going to do now amounts to a proof of 
that claim, in the context of electrostatics. 

Because V x E = 0, the line integral of E around any closed loop is zero 
(that follows from Stokes’ theorem). Because $ E - dl = 0, the line integral 
of E from point a to point b is the same for all paths (otherwise you could 
go out along path (i) and return along path (ii) — Fig. 2.30 — and obtain 
$ E- dl + 0). Because the line integral is independent of path, we can define 
a function? 


Vin) = -f Ea. (2.21) 
O 


(i) 


(ii) 


? To avoid any possible ambiguity, I should perhaps put a prime on the integration variable: 
x 
V(r) = - f Eq’) - dl’. 
O 
But this makes for cumbersome notation, and I prefer whenever possible to reserve the 


primes for source points. However, when (as in Ex. 2.7) we calculate such integrals explicitly, 
I will restore the primes. 
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Here © is some standard reference point on which we have agreed 
beforehand; V then depends only on the point r. It is called the electric 
potential. 

The potential difference between two points a and b is 


b a 
vib) - Via) =~ f Edis f E-dl 
(6) 


(0) 


b oO b 
=- f ea- f E-dli=~ f E- d. (2.22) 
oO a a 


Now, the fundamental theorem for gradients states that 


b 
V(b) - V(a) = f (VV) - dl, 
sO 


b b 
fon-a--[ E-dl. 


Since, finally, this is true for any points a and b, the integrands must be 
equal: 


E=-VV. (2.23) 


Equation 2.23 is the differential version of Eq. 2.21; it says that the electric 
field is the gradient of a scalar potential, which is what we set out to 
prove. 

Notice the subtle but crucial role played by path independence (or, 
equivalently, the fact that V x E = 0) in this argument. If the line integral 
of E depended on the path taken, then the “definition” of V, Eq. 2.21, 
would be nonsense. It would not define a function, since changing the 
path would alter the value of V(r). By the way, don’t let the minus sign 
in Eq. 2.23 distract you; it carries over from Eq. 2.21 and is largely a matter 
of convention. 


Problem 2.21. One of these is an impossible electrostatic field. Which one? 


(a) E = k[xyx + 2yzy + 3xzĉ]. 
(b) E = ky? + (Qxy + 27)¥ + 2yzal. 


Here kis a (nonzero) constant with the appropriate units. For the possible one, find the potential, 
using the origin as your reference point. Check your answer by computing V V. [Hint: You must 
select a specific path to integrate along. It doesn’t matter what path you choose, since the 
answer is path independent, but you simply cannot integrate unless you have a definite path 
in mind.] 
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2.3.2 Comments on Potential 


(i) The name. The word “potential” is a hideous misnomer because it 
inevitably reminds you of potential energy. This is particularly insidious, 
because there is a connection between “potential” and “potential energy,” 
as you will see in Section 2.4. I’m sorry that it is impossible to escape this 
word. The best I can do is to insist once and for all that “potential” and 
“potential energy” are completely different terms and should, by all rights, 
have different names. Incidentally, a region (typically a surface) over which 
the potential is constant is called an equipotential. 

(ii) Advantage of the potential formulation. If you know V, you can easily 
get E —just take the gradient: E = -V V. This is quite extraordinary when 
you stop to think about it, for E is a vector quantity (three components), 
but V is a scalar (one component). How can one function possibly contain 
all the information that three independent functions carry? The answer is 
that the three components of E are not really as independent as they look; 
in fact, they are explicitly interrelated by the very condition we started with, 
V x E = 0. In terms of the components, 


JE, OE, OE, OE, ðE, OE; 
ðy Ox’ Ody Oz’ ðz Ox’ 


This brings us back to my observation at the beginning of Section 2.3.1: E 
is a very special kind of vector. What the potential formulation does is to 
exploit this feature to maximum advantage, reducing a vector problem to a 
scalar one, in which there is no need to fuss with components. 

(iii) The reference point O. There is an essential ambiguity in the def- 
inition of potential, since the choice of reference point © was arbitrary. 
Changing reference points amounts to adding a constant K to the potential: 


r oO r 
vn=-f E-a=- | Ea- | E-dl=K+V(), 
oO’ oO’ oO 


where K is the line integral of E from the old reference point O to the 
new one O’. Of course, adding a constant to V will not affect the potential 
difference between two points: 


V'(b) - V'(a) = V(b) - Vea), 


since the Ks cancel out. (Actually, it was already clear from Eq. 2.22 that 
the potential difference is independent of O, because it can be written as 
the line integral of E from a to b, with no mention of O.) Nor does the 
ambiguity affect the gradient of V: 


VV'=VY, 


since the derivative of a constant is zero. That’s why all such Vs, differing 

only in their choice of reference point, correspond to the same field E. 
Potential as such carries no real physical significance, for at any given 

point we can adjust its value at will by a suitable relocation of ©. In this 
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sense, it is rather like altitude: If I ask you how high Denver is, you will 
probably tell me its height above sea level, because that is a convenient 
and traditional reference point. But we could as well agree to measure 
altitude above Washington, DC, or Greenwich, or wherever. That would 
add (or, rather, subtract) a fixed amount from all our sea-level readings, but 
it wouldn’t change anything about the real world. The only quantity with 
intrinsic meaning is the difference in altitude between two points, and that 
is the same whatever your reference level. 

Having said this, however, there is a “natural” spot to use for O in 
electrostatics — analogous to sea level for altitude — and that is a point 
infinitely far from the charge. Ordinarily, then, we “set the zero of potential 
at infinity.” (Since V(Q) = 0, choosing a reference point is equivalent to 
selecting a place where V is to be zero.) But I must warn you that there 
is one special circumstance in which this convention fails: when the charge 
distribution itself extends to infinity. The symptom of trouble, in such cases, 
is that the potential blows up. For instance, the field of a uniformly charged 
plane is (o/2€9)n, as we found in Ex. 2.5; if we naively put O = oo, then the 
potential at height z above the plane becomes 


“al 1 
viy=- f bao a 17%). 


The remedy is simply to choose some other reference point (in this example 
you might use a point on the plane). Notice that this difficulty occurs only 
in textbook problems; in “real life” there is no such thing as a charge 
distribution that goes on forever, and we can always use infinity as our 
reference point. 

(iv) Potential obeys the superposition principle. The original superposition 
principle pertains to the force on a test charge Q. It says that the total 
force on Q is the vector sum of the forces attributable to the source charges 
individually: 

F=F, +F +.. 


Dividing through by Q, we find that the electric field, too, obeys the 
superposition principle: 


E=E,+E,+-:-:-. 


Integrating from the common reference point to r, it follows that the 
potential also satisfies such a principle: 


V= Vitt- 


That is, the potential at any given point is the sum of the potentials due to 
all the source charges separately. Only this time it is an ordinary sum, not a 
vector sum, which makes it a lot easier to work with. 

(v) Units of Potential. In our units, force is measured in newtons and 
charge in coulombs, so electric fields are in newtons per coulomb. Accord- 
ingly, potential is newton-meters per coulomb, or joules per coulomb. 
A joule per coulomb is a volt. 
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Example 2.7. Find the potential inside and outside a spherical shell 
of radius R (Fig. 2.31) that carries a uniform surface charge. Set the 
reference point at infinity. 


Solution. From Gauss’s law, the field outside is 


l q, 
Zf 
Amey 1?” 


where q is the total charge on the sphere. The field inside is zero. For 
points outside the sphere (r > R), 


-1 "q 1 qyr 1 q 
V(r) =- E-dl= dr’ = = . 
) Í. ATED f r’? 7 4negr’'= Anegr 


To find the potential inside the sphere (r < R), we must break the 
integral into two pieces, using in each region the field that prevails 
there: 


-1 Rg gg i ; 1 qR l q 
EES. TCEQ a wee -f ar ATE r’ 'œ% E 4né) R` 
Notice that the potential is not zero inside the shell, even though the 
field vanishes! To be sure, V is a constant in this region, so that VV = 0 
— that’s what matters. In problems of this type, you must always work 
your way in from the reference point; that’s where the potential is “nailed 
down.” Itis tempting to suppose that you could figure out the potential 
inside the sphere on the basis of the field there alone, but this is false: 
the potential inside the sphere is sensitive to what’s going on outside 
the sphere as well. If I placed a second uniformly charged shell out at 
radius R’ > R, the potential inside R would change, even though the 
field would still be zero. Gauss’s law guarantees that charge exterior to 
a given point (that is, at larger r) produces no net field at that point, 
provided it is spherically or cylindrically symmetric, but there is no 
such rule for potential, when infinity is used as the reference point. 


Problem 2.22. Find the potential inside and outside a uniformly charged solid sphere whose 
radius is R and whose total charge is q. Use infinity as your reference point. Compute the 
gradient of V in each region, and check that it yields the correct field. Sketch V (r). 
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Problem 2.23. Find the potential a distance s from an infinitely long straight wire that carries 
a uniform line charge A. Compute the gradient of your potential, and check that it yields the 
correct field. 


Problem 2.24. Forthe charge configuration of Prob. 2.16, find the potential at the center, using 
infinity as your reference point. 


Problem 2.25. For the configuration of Prob. 2.17, find the potential difference between a 
point on the axis and a point on the outer cylinder. Note that it is not necessary to commit 
yourself to a particular reference point, if you use Eq. 2.22. 


2.3.3 Poisson’s Equation and Laplace's Equation 


We found in Section 2.3.1 that the electric field can be written as the 
gradient of a scalar potential, 


E=-VVP. 
The following question arises: What do the divergence and curl of E, 


V-E=Ê and VxE=0, 
€0 
tell us about V? Well, V- E = V- (-VV) = -V’V, so, apart from that 
persistent minus sign, the divergence of E is the Laplacian of V. Gauss’s 
law, then, says 


vy =f. (2.24) 


This is known as Poisson’s equation. In regions where there is no charge, so 
p = 0, Poisson’s equation reduces to Laplace’s equation, 


VV =0. (2.25) 


We’ll explore this equation more fully in Chapter 3. 
So much for Gauss’s law. What about the curl law? This says that 


VxE=Vx(-VV) =0. 


But that’s no condition on V; curl of gradient is always zero. Of course, 
we used the curl law to show that E could be expressed as the gradient of 
a scalar, so it’s not really surprising that this works out: V x E = 0 permits 
E = -VV; in return, E = -VV guarantees V x E = 0. It takes only one 
differential equation (Poisson’s) to determine V, because V is a scalar; for 
E we needed two, the divergence and the curl. 
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2.3.4 The Potential of a Localized Charge Distribution 


I defined V in terms of E (Eq. 2.21). Ordinarily, though, it’s E that we’re 
looking for (if we already knew E, there wouldn’t be much point in calcu- 
lating V). The idea is that it might be easier to get V first, and then calculate 
E by taking the gradient. Typically, then, we know where the charge is (that 
is, we know p), and we want to find V. Now, Poisson’s equation relates V 
and p, but unfortunately it’s “the wrong way around”: it would give us p, 
if we knew V, whereas we want V, knowing p. What we must do, then, is 
“invert” Poisson’s equation. That’s the program for this section, although I 
shall do it by roundabout means, beginning, as always, with a point charge 
at the origin. 

The electric field is E = (1/4me )(q/r?)f, and dl=drt + rd0 Ô + 
rsin 0 do (Eq. 1.68), so 


l q 
E-dl= = dr. 
Ameo r? r 


Setting the reference point at infinity, the potential of a point charge q at 
the origin is 


r =1 r q 1 q 
Poss] Pas C E 
” Í, ATE f ar Arte r’ 


(You see here the advantage of using infinity for the reference point: it 
kills the lower limit on the integral.) Notice the sign of V; presumably the 
conventional minus sign in the definition (Eq. 2.21) was chosen in order 
to make the potential of a positive charge come out positive. It is useful 
to remember that regions of positive charge are potential “hills,” regions 
of negative charge are potential “valleys,” and the electric field points 
“downhill,” from plus toward minus. 
In general, the potential of a point charge q is 


r l q 


o Anegr 


V(r) = 9 (2.26) 
0 


where %, as always, is the distance from q to r (Fig. 2.32). Invoking the 
superposition principle, then, the potential of a collection of charges is 


n 
Gi 
4T E0 A Die 


P P P 
a 2; a 
qe ° 
q «5 d 
qi Ee 


1e © 


V(r) = (2.27) 
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or, for a continuous distribution, 


1 1 
= —— | -dq. 2.28 
V(r) Ae f z q ( ) 
In particular, for a volume charge, it is 
1 p) 
= a 2.2 
V(r) ire, s dt (2.29) 


This is the equation we were looking for, telling us how to compute V when 
we know pọ; it is, if you like, the “solution” to Poisson’s equation, for a 
localized charge distribution.'!° Compare Eq. 2.29 with the corresponding 
formula for the electric field in terms of p (Eq. 2.8): 


1 p(t’) 


=, p $ 
~ 4TE9 a2 RA 


E(r) 


The main point to notice is that the pesky unit vector is gone, so you 
don’t have to worry about components. The potentials of line and surface 
charges are 


V= : [Ga and V= : [Eu (2.30) 


E ATED a ~ ATED YA 


I should warn you that everything in this section is predicated on the 
assumption that the reference point is at infinity. This is certainly not obvi- 
ous in Eq. 2.29, but remember that we got that equation from the potential 
of a point charge at the origin, (1/47e€9)(q/r), which is valid only when 
O = œ. If you try to apply these formulas to one of those artificial problems 
in which the charge itself extends to infinity, the integral will diverge — that’s 
the alarm bell, telling you to go back and fix the reference point. 


Example 2.8. Find the potential of a uniformly charged spherical shell 
of radius R (Fig. 2.33). 


10 Equation 2.29 is an example of the Helmholtz theorem (Appendix B). 
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Solution. This is the same problem we solved in Ex. 2.7, but this time 
let’s do it using Eq. 2.30: 


1 o 
V(r) = — da’. 
o) ATEQ { YA ý 


We might as well set the point P on the z-axis and use the law of cosines 
to express 7: 


2 =R +27 —2Rzc0s @". 


An element of surface area on the sphere is R° sin 0’ d0’ dd’, so 


R? sin 0’ dO’ dd’ 
4néyV(z) =0 
VR? + 22 — 2Rz cos @’ 
y sin 0’ 
= 2n R?o f do’ 
0 VR? +22 —2Rzcos 0’ 


1 
= 2nR°o(5-VR + z2 —2Rzcos 0’) 
27R 
i ° (VR? + 2 4 2Rz - VR? + 2 —2Rz) 


Z 
2 SANTIE saa 2]. 


At this stage, we must be very careful to take the positive root. For 


points outside the sphere, z is greater than R, and hence y(R — z)? = 
z — R; for points inside the sphere, /(R — z)? = R — z. Thus, 


T 
0 


2 
Ro iR +z)-(z-R] = ii outside, 
V(z)= 2eoz €0z 
Ro 
——|(R+z) -(R-2z)] = — inside. 
2€0z E0 


In terms of r and the total charge on the shell, q = 47 Ro, 


1 
=. (r > R), 
TT r 
eae y 
= < 
fre 


Of course, in this particular case, it was easier to get V by using 
Eq. 2.21 than Eq. 2.30, because Gauss’s law gave us E with so little 
effort. But if you compare Ex. 2.8 with Prob. 2.7, you will appreciate 
the power of the potential formulation. 


Problem 2.26. Using Eqs. 2.27 and 2.30, find the potential at a distance z above the cen- 
ter of the charge distributions in Fig. 2.34. In each case, compute E = —V V, and compare 
your answers with Ex. 2.1, Ex. 2.2, and Prob. 2.6, respectively. Suppose that we changed the 
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right-hand charge in Fig. 2.34a to —q: What then is the potential at P? What field does that 
suggest? Compare your answer to Prob. 2.2, and explain carefully any discrepancy. 


P eP eP 
l l l 
| l l 
zi Zl al 
l l l 
l l l 
l l 
l l 
© l < PA 
+q d +q 2E 
(a) Two point charges (b) Uniform line charge (c) Uniform surface charge 
Fig. 2.34 


Problem 2.27. A conical surface (an empty ice-cream cone) carries a uniform surface charge 
a. The height of the cone is /, as is the radius of the top. Find the potential difference between 
points a (the vertex) and b (the center of the top). 


Problem 2.28. Find the potential on the axis of a uniformly charged solid cylinder, a distance 
z from the center. The length of the cylinder is Z, its radius is R, and the charge density is p. 
Use your result to calculate the electric field at this point. (Assume that z > L/2.) 


Problem 2.29. Use Eq. 2.29 to calculate the potential inside a uniformly charged solid sphere 
of radius R and total charge q. Compare your answer to Prob. 2.22. 


Problem 2.30. Check that Eq. 2.29 satisfies Poisson's equation, by applying the Laplacian and 
using Eq. 1.102. 


2.3.5 Boundary Conditions 


In the typical electrostatic problem you are given a source charge distri- 
bution p, and you want to find the electric field E it produces. Unless the 
symmetry of the problem allows a solution by Gauss’s law, it is generally 
to your advantage to calculate the potential first, as an intermediate step. 
These are the three fundamental quantities of electrostatics: p, E, and V. 
We have, in the course of our discussion, derived all six formulas interrelat- 
ing them. These equations are neatly summarized in Fig. 2.35. We began 
with just two experimental observations: (1) the principle of superposi- 
tion — a broad general rule applying to all electromagnetic forces, and (2) 
Coulomb’s law — the fundamental law of electrostatics. From these, all else 
followed. 

You may have noticed, in studying Exs. 2.5 and 2.6, or working problems 
such as 2.7, 2.12, and 2.17, that the electric field always undergoes a 
discontinuity when you cross a surface charge o. In fact, itis a simple matter 
to find the amount by which E changes at such a boundary. Suppose we draw 
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ue 
above 


a wafer-thin Gaussian pillbox, extending just barely over the edge in each 
direction (Fig. 2.36). Gauss’s law says that 


1 1 
PE da = —Qenc = —OA, 
S €0 


€0 
where A is the area of the pillbox lid. (If o varies from point to point or 
the surface is curved, we must pick A to be extremely small.) Now, the sides 
of the pillbox contribute nothing to the flux, in the limit as the thickness e€ 
goes to zero, so we are left with 


- EŁ m (2.31) 


Et = — 
below E0 


above 


where Ei ove denotes the component of E that is perpendicular to the 


surface immediately above, and EZjow is the same, only just below the 
surface. For consistency, we let “upward” be the positive direction for both. 
Conclusion: The normal component of E is discontinuous by an amount 
dleo at any boundary. In particular, where there is no surface charge, Æ+ 
is continuous, as for instance at the surface of a uniformly charged solid 
sphere. 

The tangential component of E, by contrast, is always continuous. For 


if we apply Eq. 2.19, 
fe -dì =0, 
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to the thin rectangular loop of Fig. 2.37, the ends give nothing (as e€ — 0), 


and the sides give (El od — Elio D so 
ll _ p! 
Bas ea E elow’ (2.32) 


where E! stands for the components of E parallel to the surface. The 
boundary conditions on E (Eqs. 2.31 and 2.32) can be combined into a 
single formula: 


oO. 
E,bove — Evelow = — Â, (2.33) 
€0 
where nis a unit vector perpendicular to the surface, pointing from “below” 
to “above.”!! 


The potential, meanwhile, is continuous across any boundary (Fig. 2.38), 
since 


b 
Vavove — Vbelow = - f E- dl; 
a 
as the path length shrinks to zero, so too does the integral: 


Vabove = Vbelow- (2 a 34) 


However, the gradient of V inherits the discontinuity in E; since E = -V V, 
Eq. 2.33 implies that 


l 
V Vabove — Y Voelow = “a (2.35) 


11 Notice that it doesn’t matter which side you call “above” and which “below,” since reversal 


would switch the direction of n. Incidentally, if you’re only interested in the field due to the 
(essentially flat) local patch of surface charge itself, the answer is (o/2€9)n immediately 
above the surface, and —(c/2e9) immediately below. This follows from Ex. 2.5, for if 
you are close enough to the patch it “looks” like an infinite plane. Evidently, the entire 
discontinuity in E is attributable to this local patch of surface charge. 
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or, more conveniently, 


ð Vabove ð Vbelow 1 
ee i D a 
on On €0 a Bem) 


where 


(2.37) 


denotes the normal derivative of V (that is, the rate of change in the direction 
perpendicular to the surface). 

Please note that these boundary conditions relate the fields and poten- 
tials just above and just below the surface. For example, the derivatives in 
Eq. 2.36 are the limiting values as we approach the surface from either side. 


Problem 2.31. 


(a) Check that the results of Exs. 2.5 and 2.6, and Prob. 2.12, are consistent with Eq. 2.33. 

(b) Use Gauss’s law to find the field inside and outside a long hollow cylindrical tube, which 
carries a uniform surface charge o. Check that your result is consistent with Eq. 2.33. 

(c) Check that the result of Ex. 2.8 is consistent with boundary conditions 2.34 and 2.36. 


2.4 Work and Energy in Electrostatics 
ee 


2.4.1 The Work It Takes to Move a Charge 


Suppose you have a stationary configuration of source charges, and you 
want to move a test charge Q from point a to point b (Fig. 2.39). Question: 
How much work will you have to do? At any point along the path, the 
electric force on Q is F = QE; the force you must exert, in opposition to 
this electrical force, is -QE. (If the sign bothers you, think about lifting a 
brick: gravity exerts a force mg downward, but you exert a force mg upward. 
Of course, you could apply an even greater force — then the brick would 
accelerate, and part of your effort would be “wasted” generating kinetic 
energy. What we're interested in here is the minimum force you must exert 
to do the job.) The work you do is therefore 


b b 
w= { F-a=-of[ E-dl= O[V(b) - V(a)]. 


Me. 
ee y 2 


e 
ge ° 1 
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Notice that the answer is independent of the path you take from a to b; 
in mechanics, then, we would call the electrostatic force “conservative.” 
Dividing through by Q, we have 


W 
V(b) - V(a) = —. (2.38) 

Q 
In words, the potential difference between points a and b is equal to the work it 
takes, per unit charge, to carry a particle froma to b. In particular, if you want 
to bring Q in from far away and stick it at point r, the work you must do is 


W = Q[V r) — V(oo)], 
so, if you have set the reference point at infinity, 
W = QV (r). (2.39) 


In this sense, potential is potential energy (the work required to create the 
system) per unit charge (just as the field is the force per unit charge). 


2.4.2 The Energy of a Point Charge Distribution 


How much work would it take to assemble an entire collection of point 
charges? Imagine bringing in the charges, one by one, from far away 
(Fig. 2.40). The first charge, qi, takes no work, since there is no field yet 
to fight against. Now bring in q2. According to Eq. 2.39, this will cost you 
q2V\(t2), where V; is the potential due to qı, and rz is the place we're 
putting q2: 


1 qı 
W2 = eR Cee 
(212 is the distance between qı and q2 once they are in position). As you 
bring in each charge, nail it down in its final location, so it doesn’t move 
when you bring in the next charge. Now bring in q3; this requires work 
q3 V1 2(r3), where Vız is the potential due to charges qı and q2, namely, 
(1/4r Eeo) (q1/%13 + qol2y3). Thus 


1 
a ws + = 


E 4TLEg 
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Similarly, the extra work to bring in q4 will be 


W4 = 


1 a Qo B 
+ + . 
Amey Cr, 24 my 


The total work necessary to assemble the first four charges, then, is 


1 
_ (= $ 193 45 1194 mn 9293 Pi q2q4 si BH). 
Arey ` %12 113 i4 23 224 134 


You see the general rule: take the product of each pair of charges, divide 
by their separation distance, and add it all up: 


n n 


=z 3 es lidi (2.40) 


i=1 j>i 


The stipulation j > iis to remind you not to count the same pair twice. 
A nicer way to accomplish this is intentionally to count each pair twice, and 
then divide by 2: 


l SS (2.41) 


Sreo E iat ti 


(we must still exclude i = j, of course). Notice that in this form the answer 
plainly does not depend on the order in which you assemble the charges, 
since every pair occurs in the sum. 

Finally, let’s pull out the factor q;: 


= > Da ATEO a) 


The term in parentheses is the potential at point r; (the position of q;) due to 
all the other charges — all of them, now, not just the ones that were present 
at some stage during the assembly. Thus, 


1 n 
= 5) 4V. (2.42) 
i=l 


That’s how much work it takes to assemble a configuration of point charges; 
it’s also the amount of work you'd get back if you dismantled the system. In 
the meantime, it represents energy stored in the configuration (“potential” 
energy, if you insist, though for obvious reasons I prefer to avoid that word 
in this context). 


Problem 2.32. 


(a) Three charges are situated at the corners of a square (side length a), as shown in Fig. 2.41. 
How much work does it take to bring in another charge, +q, from far away and place it in 
the fourth corner? 

(b) How much work does it take to assemble the entire configuration of four charges? 
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4 


+q =) 
Fig. 2.41 


Problem 2.33. Two positive point charges, q4 and qg (masses m4 and mp) are at rest, 
held together by a massless string of length a. Now the string is cut, and the particles fly off 
in opposite directions. How fast is each one going, when they are far apart? 


Problem 2.34. Consider an infinite chain of point charges, +q (with alternating signs), strung 
out along the x-axis, each a distance a from its nearest neighbors. Find the work per particle 
required to assemble this system. [Partial answer: —aq7/(47¢€ 9a), for some dimensionless 
number a; your problem is to determine æ. It is known as the Madelung constant. Calcu- 
lating the Madelung constant for two- and three-dimensional arrays is much more subtle and 
difficult.] 


2.4.3 The Energy of a Continuous Charge Distribution 


For a volume charge density p, Eq. 2.42 becomes 
1 
W= 5 f pV dr. (2.43) 


(The corresponding integrals for line and surface charges would be f AV dl 
and f oV da.) There is a lovely way to rewrite this result, in which p and V 
are eliminated in favor of E. First use Gauss’s law to express p in terms of E: 


p=e€0V-E, so w=2 [V-ra 


Now use integration by parts (Eq. 1.59) to transfer the derivative from 


Eto V: 
w=2]|- |e- wnae VE- dal. 
2 y S 


But VV = —-E, so 
W = a/f Eara VE- da). (2.44) 
2 Vv S 


But what volume V is this we’re integrating over? Let’s go back to the 
formula we started with, Eq. 2.43. From its derivation, it is clear that we 
should integrate over the region where the charge is located. But actually, 
any larger volume would do just as well: the “extra” territory we throw in 
will contribute nothing to the integral, since p = 0 out there. With this in 
mind, we return to Eq. 2.44. What happens here, as we enlarge the volume 
beyond the minimum necessary to trap all the charge? Well, the integral of 
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E? can only increase (the integrand being positive); evidently the surface 
integral must decrease correspondingly, to leave the sum intact. (In fact, at 
large distances from the charge, E goes like 1/7? and V like 1/r, while the 
surface area grows like r?°; roughly speaking, then, the surface integral goes 
down like 1/r.) Please understand: Eq. 2.44 gives you the correct energy 
W, whatever volume you use (as long as it encloses all the charge), but 
the contribution from the volume integral goes up, and that of the surface 
integral goes down, as you take larger and larger volumes. In particular, why 
not integrate over all space? Then the surface integral goes to zero, and we 
are left with 


W= + { E2dt | (all space). (2.45) 


Example 2.9. Find the energy of a uniformly charged spherical shell 
of total charge g and radius R. 


Solution 1. Use Eq. 2.43, in the version appropriate to surface charges: 


1 
W = JEZ 


Now, the potential at the surface of this sphere is (1/4reo)q/R, a 
constant (Ex. 2.7), so 


lq 1 g 
= = da = =. 
ue 87eo R f oom 87eq R 


Solution 2. Use Eq. 2.45. Inside the sphere, E = 0; outside, 


lq. 2 g 
Arey i 2 (A4m€,)214 


Therefore, 


2 
= €0 q 2 
W = ay f (Se sin 0 dr d0 dd) 


outside 


ee ace lg 
= 4 — dr = ZS 
neg k f r? í 8méo R 


Problem 2.35. Find the energy stored in a uniformly charged solid sphere of radius R and 
charge q. Do it three different ways: 


(a) Use Eq. 2.43. You found the potential in Prob. 2.22. 
(b) Use Eq. 2.45. Don't forget to integrate over all space. 
(c) Use Eq. 2.44. Take a spherical volume of radius a. What happens as a — oo? 
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Problem 2.36. Here is a fourth way of computing the energy of a uniformly charged solid 
sphere (you might call it the method of assembly): Build it up like a snowball, layer by layer, 
each time bringing in an infinitesimal charge dq from far away and smearing it uniformly over 
the surface, thereby increasing the radius. How much work dW does it take to advance the 
radius from r to r + dr? Integrate your result to find the work necessary to create the entire 
sphere of radius R and total charge q. 


2.4.4 Comments on Electrostatic Energy 


(i) A perplexing “inconsistency.” Equation 2.45 clearly implies that the 
energy of a stationary charge distribution is always positive. On the other 
hand, Eq. 2.42 (from which 2.45 was derived), can be positive or negative. 
For instance, according to Eq. 2.42, the energy of two equal but opposite 
charges a distance 2 apart is —(1/47t€9)(q°/z). What’s gone wrong? Which 
equation is correct? 

The answer is that both are correct, but they address slightly different 
questions. Equation 2.42 does not take into account the work necessary 
to make the point charges in the first place; we started with point charges 
and simply found the work required to bring them together. This is wise 
strategy, since Eq. 2.45 indicates that the energy of a point charge is in fact 
infinite: 


W = at f (G) sino drdo do) = a | e=% 
2(4TE€0)? 7 8BTEeo Jo r? 


Equation 2.45 is more complete, in the sense that it tells you the total energy 
stored in a charge configuration, but Eq. 2.42 is more appropriate when 
you're dealing with point charges, because we prefer (for good reason!) 
to leave out that portion of the total energy that is attributable to the 
fabrication of the point charges themselves. In practice, after all, the point 
charges (electrons, say) are given to us ready-made; all we do is move them 
around. Since we did not put them together, and we cannot take them 
apart, it is immaterial how much work the process would involve. (Still, the 
infinite energy of a point charge is a recurring source of embarrassment 
for electromagnetic theory, afflicting the quantum version as well as the 
classical. We shall return to the problem in Chapter 11.) 

Now, you may wonder where the inconsistency crept into an ostensibly 
watertight derivation. The “flaw” lies between Eqs. 2.42 and 2.43: in the 
former, V (r;) represents the potential due to all the other charges but not qi, 
whereas in the latter, V (r) is the full potential. For a continuous distribution, 
there is no distinction, since the amount of charge right at the point r is 
vanishingly small, and its contribution to the potential is zero. But in the 
presence of point charges you'd better stick with Eq. 2.42. 

(ii) Where is the energy stored? Equations 2.43 and 2.45 offer two dif- 
ferent ways of calculating the same thing. The first is an integral over 
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the charge distribution; the second is an integral over the field. These 
can involve completely different regions. For instance, in the case of the 
spherical shell (Ex. 2.9), the charge is confined to the surface, whereas the 
electric field is everywhere outside this surface. Where is the energy, then? 
Is it stored in the field, as Eq. 2.45 suggests, or is it stored in the charge, 
as Eq. 2.43 implies? At the present stage this is simply an unanswerable 
question: I can tell you what the total energy is, and I can provide you with 
several different ways to calculate it, but it is pointless to worry about where 
the energy is located. In the context of radiation theory (Chapter 11) it is 
useful (and in general relativity it is essential) to regard the energy as stored 
in the field, with a density 


SE = energy per unit volume. (2.46) 
But in electrostatics one could just as well say it is stored in the charge, with 
a density 5 pV. The difference is purely a question of bookkeeping.” 

(iii) The superposition principle. Because electrostatic energy 1s quadratic 
in the fields, it does not obey a superposition principle. The energy of a 
compound system is not the sum of the energies of its parts considered 
separately — there are also “cross terms”: 


Wa=2 f Bdr= 2 f Œ +Eyde 


2 


S f E} + E} + 2E Ede 


WwW, T Waz +eo Ei -E2 dt. (2.47) 


For example, if you double the charge everywhere, you quadruple the total 
energy. 


Problem 2.37. Consider two concentric spherical shells, of radii a and b. Suppose the inner 
one carries a charge q, and the outer one a charge —q (both of them uniformly distributed over 
the surface). Calculate the energy of this configuration, (a) using Eq. 2.45, and (b) using Eq. 2.47 
and the results of Ex. 2.9. 


Problem 2.38. Find the interaction energy (€ f E; - Ep dT in Eq. 2.47) for two point 
charges, qı and q, a distance a apart. [Hint: Put q; at the origin and q2 on the z-axis; use 
spherical coordinates, and do the r integral first. ] 


12 Ever since Maxwell, physicists have agonized over the “location” of electrostatic energy. 
Feynman, in his inimitable way, lays the matter to rest: “Who cares? What’s the meaning 
of such a question? ... If we restrict ourselves to electrostatics there is really no way to tell 
where the energy is located.” R. P. Feynman, R. B. Leighton, and M. Sands, The Feynman 
Lectures on Physics, Vol. 2, Addison-Wesley, Reading, MA (1967), Chap. 8, p. 8. 
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2.5 Conductors 
T] 


2.5.1 Basic Properties 


In an insulator, such as glass or rubber, each electron is on a short leash, 
attached to a particular atom. In a metallic conductor, by contrast, one or 
more electrons per atom are free to roam. (In liquid conductors such as salt 
water, it is ions that do the moving.) A perfect conductor would contain an 
unlimited supply of free charges. In real life there are no perfect conductors, 
but metals come pretty close, for most purposes. 

From this definition, the basic electrostatic properties of ideal conduc- 
tors immediately follow: 

(i) E =0 inside a conductor. Why? Because if there were any field, those 
free charges would move, and it wouldn’t be electrostatics any more. 
Hmm... that’s hardly a satisfactory explanation; maybe all it proves is that 
you can’t have electrostatics when conductors are present. We had better 
examine what happens when you put a conductor into an external electric 
field Ep (Fig. 2.42). Initially, the field will drive any free positive charges to 
the right, and negative ones to the left. (In practice, it’s the negative charges 
— electrons — that do the moving, but when they depart, the right-hand side 
is left with a net positive charge — the stationary nuclei — so it doesn’t really 
matter which charges move; the effect is the same.) When they come to the 
edge of the material, the charges pile up: plus on the right-hand side, minus 
on the left. Now, these induced charges produce a field of their own, E, 
which, as you can see from the figure, is in the opposite direction to Ey. That’s 
the crucial point, for it means that the field of the induced charges tends to 
cancel the original field. Charge will continue to flow until this cancelation 
is complete, and the resultant field inside the conductor is precisely zero. 1? 
The whole process is practically instantaneous. 


ARLA 
& 
tettteteeteetttst 


E ee 


13 Outside the conductor the field is not zero, for here Ey and E; do not tend to cancel. 
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~ Conductor 
E=0 


(ii) p = 0 inside a conductor. This follows from Gauss’s law: V -E = p/eo. 
If E is zero, so also is p. There is still charge around, but exactly as much 
plus as minus, so the net charge density in the interior is zero. 

(iii) Any net charge resides on the surface. That’s the only place left. 

(iv) A conductor is an equipotential. For if a and b are any two points 
within (or at the surface of) a given conductor, V (b)- V (a) = — S ” E.d] = 0, 
and hence V (a) = V(b). 

(v) E is perpendicular to the surface, just outside a conductor. Otherwise, 
as in (i), charge will immediately flow around the surface until it kills off 
the tangential component (Fig. 2.43). (Perpendicular to the surface, charge 
cannot flow, of course, since it is confined to the conducting object.) 

I think it is astonishing that the charge on a conductor flows to the 
surface. Because of their mutual repulsion, the charges naturally spread 
out as much as possible, but for all of them to go to the surface seems like 
a waste of the interior space. Surely we could do better, from the point of 
view of making each charge as far as possible from its neighbors, to sprinkle 
some of them throughout the volume .... Well, it simply is not so. You do 
best to put all the charge on the surface, and this is true regardless of the 
size or shape of the conductor. '* 

The problem can also be phrased in terms of energy. Like any other 
free dynamical system, the charge on a conductor will seek the con- 
figuration that minimizes its potential energy. Property (iii) asserts that 
the electrostatic energy of a solid object (with specified shape and total 
charge) is a minimum when that charge is spread over the surface. For 
instance, the energy of a sphere is (1/87¢€)(q’/R) if the charge is uniformly 
distributed over the surface, as we found in Ex. 2.9, but it is greater, 
(3/207€9)(q7/R), if the charge is uniformly distributed throughout the 
volume (Prob. 2.35). 


14 By the way, the one- and two-dimensional analogs are quite different: The charge on a 
conducting disk does not all go to the perimeter (R. Friedberg, Am. J. Phys. 61, 1084 (1993)), 
nor does the charge on a conducting needle go to the two ends (D. J. Griffiths and Y. Li, 
Am. J. Phys. 64, 706 (1996)) — see Prob. 2.58. Moreover, if the exponent of r in Coulomb’s 
law were not precisely 2, the charge on a solid conductor would not all go to the surface — 
see D. J. Griffiths and D. Z. Uvanovic, Am. J. Phys. 69, 435 (2001), and Prob. 2.55g. 
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2.5.2 Induced Charges 


If you hold a charge +g near an uncharged spherical conductor (Fig. 2.44), 
the two will attract one another. The reason for this is that q will pull minus 
charges over to the near side and repel plus charges to the far side. (Another 
way to think of it is that the charge moves around in such a way as to kill 
off the field of q for points inside the conductor, where the total field must 
be zero.) Since the negative induced charge is closer to q, there is a net force 
of attraction. (In Chapter 3 we shall calculate this force explicitly, for the 
case of a spherical conductor.) 

When I speak of the field, charge, or potential “inside” a conductor, I 
mean in the “meat” of the conductor; if there is some hollow cavity in 
the conductor, and within that cavity you put some charge, then the field 
in the cavity will not be zero. But in a remarkable way the cavity and its 
contents are electrically isolated from the outside world by the surrounding 
conductor (Fig. 2.45). No external fields penetrate the conductor; they are 
canceled at the outer surface by the induced charge there. Similarly, the 
field due to charges within the cavity is canceled, for all exterior points, by 
the induced charge on the inner surface. However, the compensating charge 
left over on the outer surface of the conductor effectively “communicates” 
the presence of q to the outside world. The total charge induced on the 
cavity wall is equal and opposite to the charge inside, for if we surround 
the cavity with a Gaussian surface, all points of which are in the conductor 
(Fig. 2.45), $ E. da = 0, and hence (by Gauss’s law) the net enclosed charge 
must be zero. But Qene = q + induced, SO induced = —4. And if the conductor 
as a whole is electrically neutral, there must be a charge +q on its outer 
surface. 


Example 2.10. An uncharged spherical conductor, centered at the ori- 
gin, has a cavity of some weird shape carved out of it (Fig. 2.46). Some- 
where within the cavity is a charge g. Question: What is the field outside 
the sphere? 


98 


2 Electrostatics 


ef 


+q 
J 
N Cavity 


Conductor 


Solution. At first glance, it would appear that the answer depends on 

the shape of the cavity and the location of the charge. But that’s wrong: 
the answer is 

1 q. 

4TtEg 72 


regardless. The conductor conceals from us all information concern- 
ing the nature of the cavity, revealing only the total charge it con- 
tains. How can this be? Well, the charge +q induces an opposite 
charge —g on the wall of the cavity, which distributes itself in such 
a way that its field cancels that of g, for all points exterior to the 
cavity. Since the conductor carries no net charge, this leaves +q to 
distribute itself uniformly over the surface of the sphere. (It’s uni- 
form because the asymmetrical influence of the point charge +q is 
negated by that of the induced charge -q on the inner surface.) 
For points outside the sphere, then, the only thing that survives is 
the field of the leftover +g, uniformly distributed over the outer 
surface. 

It may occur to you that in one respect this argument is open to 
challenge: There are actually three fields at work here: E}, Einducea, and 
Ejeftover- All we know for certain is that the sum of the three is zero 
inside the conductor, yet I claimed that the first two alone cancel, while 
the third is separately zero there. Moreover, even if the first two cancel 
within the conductor, who is to say they still cancel for points outside? 
They do not, after all, cancel for points inside the cavity. I cannot give 
you a completely satisfactory answer at the moment, but this much 
at least is true: There exists a way of distributing —q over the inner 
surface so as to cancel the field of q at all points exterior to the cavity. 
For that same cavity could have been carved out of a huge spherical 
conductor with a radius of 27 miles, or light years, or whatever. In 
that case, the leftover +q on the outer surface is simply too far away 
to produce a significant field, and the other two fields would have to 
accomplish the cancelation by themselves. So we know they can do 
it... but are we sure they choose to? Perhaps for small spheres nature 
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prefers some complicated three-way cancelation. Nope: As we'll see in 
the uniqueness theorems of Chapter 3, electrostatics is very stingy with 
its options; there is always precisely one way — no more — of distributing 
the charge on a conductor so as to make the field inside zero. Having 
found a possible way, we are guaranteed that no alternative exists, even 
in principle. 


If a cavity surrounded by conducting material is itself empty of charge, 
then the field within the cavity is zero. For any field line would have to 
begin and end on the cavity wall, going from a plus charge to a minus 
charge (Fig. 2.47). Letting that field line be part of a closed loop, the rest 
of which is entirely inside the conductor (where E = 0), the integral $ E-dl 
is distinctly positive, in violation of Eq. 2.19. It follows that E = 0 within 
an empty cavity, and there is in fact no charge on the surface of the cavity. 
(This is why you are relatively safe inside a metal car during a thunderstorm 
— you may get cooked, if lightning strikes, but you will not be electrocuted. 
The same principle applies to the placement of sensitive apparatus inside 
a grounded Faraday cage, to shield out stray electric fields. In practice, the 
enclosure doesn’t even have to be a solid conductor — chicken wire will 
often suffice.) 


Problem 2.39. A metal sphere of radius R, carrying charge q, is surrounded by a thick 
concentric metal shell (inner radius a, outer radius b, as in Fig. 2.48). The shell carries no net 
charge. 


(a) Find the surface charge density o at R, at a, and at b. 

(b) Find the potential at the center, using infinity as the reference point. 

(c) Now the outer surface is touched to a grounding wire, which drains off charge and lowers 
its potential to zero (same as at infinity). How do your answers to (a) and (b) change? 


Problem 2.40. Two spherical cavities, of radii a and b, are hollowed out from the interior of a 
(neutral) conducting sphere of radius R (Fig. 2.49). At the center of each cavity a point charge 
is placed — call these charges qa and qb. 


(a) Find the surface charge densities o4, op, and op. 
(b) What is the field outside the conductor? 


100 2 Electrostatics 


(c) Whats the field within each cavity? 

(d) Whatis the force on g, and qp? 

(e) Which of these answers would change ifa third charge, qc, were brought near the conductor 
(but outside it)? 


R 


Fig. 2.48 


Problem 2.41. 


(a) A point charge q is inside a cavity in an uncharged conductor (Fig. 2.45). Is the force on q 
necessarily zero?” 
(b) Is the force between a point charge and a nearby uncharged conductor always attractive?" 


2.5.3 Surface Charge and the Force on a Conductor 


Because the field inside a conductor is zero, Eq. 2.33 requires that the field 


immediately outside is 
oO. 
E = —n, (2.48) 
€0 
consistent with our earlier conclusion that the field is normal to the surface. 


In terms of potential, Eq. 2.36 yields 


o= we (2.49) 
on 
These equations enable you to calculate the surface charge on a conductor, 
if you can determine E or V; we shall use them frequently in the next 
chapter. 

In the presence of an electric field, any surface charge will experience a 
force; the force per unit area, f, is oE. But there’s a problem here, for the 
electric field is discontinuous at a surface charge, so what are we supposed to 
use: Eaboves Epelow, OF Something in between? The answer is that we should 
use the average of the two: 


1 
f= OEayerage = 70 (Eabove + Epelow)- (2.50) 


15 This problem was suggested by Nelson Christensen. 
16 See M. Levin and S. G. Johnson, Am. J. Phys. 79, 843 (2011). 
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Why the average? The reason is very simple, though the telling makes it 
sound complicated: Let’s focus our attention on a tiny patch of surface 
surrounding the point in question (Fig. 2.50). (Make it small enough so it is 
essentially flat and the surface charge on it essentially constant.) The total 
field consists of two parts — that attributable to the patch itself, and that 
due to everything else (other regions of the surface, as well as any external 
sources that may be present): 


E= Epatch + Eother- 


Now, the patch cannot exert a force on itself, any more than you can lift 
yourself by standing in a basket and pulling up on the handles. The force on 
the patch, then, is due exclusively to Eother, and this suffers no discontinuity 
(if we removed the patch, the field in the “hole” would be perfectly smooth). 
The discontinuity is due entirely to the charge on the patch, which puts out 
a field (o/2€0) on either side, pointing away from the surface. Thus, 

E,above = Eother + so 

€0 


Ebetow = Eother z n, 


Zeo 
and hence 
1 
Eother = z (Eabove + Ebelow) = Eaverage- 


Averaging is really just a device for removing the contribution of the patch 
itself. 

That argument applies to any surface charge; in the particular case of 
a conductor, the field is zero inside and (o/€9)n outside (Eq. 2.48), so the 
average is (o/2€9)n, and the force per unit area is 


1 4. 
f =o". (2.51) 
2e 0 
This amounts to an outward electrostatic pressure on the surface, tending 
to draw the conductor into the field, regardless of the sign of o. Expressing 


the pressure in terms of the field just outside the surface, 


=2 F. (2.52) 
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Problem 2.42. Two large metal plates (each of area A) are held a small distance d apart. 
Suppose we put a charge Q on each plate; what is the electrostatic pressure on the plates? 


Problem 2.43. A metal sphere of radius R carries a total charge Q. What is the force of 
repulsion between the “northern” hemisphere and the “southern” hemisphere? 


2.5.4 Capacitors 


Suppose we have two conductors, and we put charge +Q on one and —Q on 
the other (Fig. 2.51). Since V is constant over a conductor, we can speak 
unambiguously of the potential difference between them: 
(+) 
v=V.-v.=-[ E- dl. 
(G 

We don’t know how the charge distributes itself over the two conductors, 
and calculating the field would be a nightmare if their shapes are compli- 
cated, but this much we do know: E is proportional to Q, for E is given by 
Coulomb’s law: 


so if you double p, you double E. [Wait a minute! How do we know that 
doubling Q (and also —Q) simply doubles p? Maybe the charge moves 
around into a completely different configuration, quadrupling p in some 
places and halving it in others, just so the total charge on each conductor 
is doubled. The fact is that this concern is unwarranted — doubling Q does 
double p everywhere; it doesn’t shift the charge around. The proof of this 
will come in Chapter 3; for now you'll just have to trust me.] 

Since E is proportional to Q, so also is V. The constant of proportion- 
ality is called the capacitance of the arrangement: 


C==. (2.53) 


Capacitance is a purely geometrical quantity, determined by the sizes, 
shapes, and separation of the two conductors. In SI units, C is measured 
in farads (F); a farad is a coulomb-per-volt. Actually, this turns out to be 
inconveniently large; more practical units are the microfarad (1076 F) and 
the picofarad (107!? F). 


+Q =Q 
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Notice that V is, by definition, the potential of the positive conductor 
minus that of the negative one; likewise, Q is the charge on the positive 
conductor. Accordingly, capacitance is an intrinsically positive quantity. 
(By the way, you will occasionally hear someone speak of the capacitance of 
a single conductor. In this case the “second conductor,” with the negative 
charge, is an imaginary spherical shell of infinite radius surrounding the 
one conductor. It contributes nothing to the field, so the capacitance is 
given by Eq. 2.53, where V is the potential with infinity as the reference 
point.) 


Example 2.11. Find the capacitance of a parallel-plate capacitor con- 
sisting of two metal surfaces of area A held a distance d apart 
(Fig. 2.52). 


Solution. If we put +Q on the top and -Ọ on the bottom, they 
will spread out uniformly over the two surfaces, provided the area 
is reasonably large and the separation small.'’ The surface charge 
density, then, is o = Q/A on the top plate, and so the field, according 
to Ex. 2.6, is (1/€9)Q/A. The potential difference between the plates is 
therefore 
Q 
= ae 


and hence 


= A€o 
C=—. (2.54) 


If, for instance, the plates are square, with sides 1 cm long, and they are 
held 1 mm apart, then the capacitance is 9 x 107} F. 


Example 2.12. Find the capacitance of two concentric spherical metal 
shells, with radii a and b. 


Solution. Place charge +Q on the inner sphere, and -Q on the outer 
one. The field between the spheres is 


1 Q, 


= s.r 
4TE9 r2 ° 


= 


The exact solution is not easy — even for the simpler case of circular plates. See G. T. Carlson 
and B. L. Illman, Am. J. Phys. 62, 1099 (1994). 
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so the potential difference between them is 


a Q 71 Q 1 1 
f 4rneo Jp r? á o 5) 


As promised, V is proportional to Q; the capacitance is 


dat cin 


a 
V (b-a) 


To “charge up” a capacitor, you have to remove electrons from the 
positive plate and carry them to the negative plate. In doing so, you fight 
against the electric field, which is pulling them back toward the positive 
conductor and pushing them away from the negative one. How much work 
does it take, then, to charge the capacitor up to a final amount Q? Suppose 
that at some intermediate stage in the process the charge on the positive 
plate is q, so that the potential difference is g/C. According to Eq. 2.38, the 
work you must do to transport the next piece of charge, dq, is 

dW = (4) ay. 
The total work necessary, then, to go from g = 0 to g = Q, is 


Q 1 0 
w= | (2)a0=55 
or, since Q = CV, 
1 
W=5cV’, (2.55) 


where V is the final potential of the capacitor. 


Problem 2.44. Find the capacitance per unit length of two coaxial metal cylindrical tubes, of 
radii a and b (Fig. 2.53). 


A 


Fig. 2.53 


Problem 2.45. Suppose the plates of a parallel-plate capacitor move closer together by an 
infinitesimal distance e, as a result of their mutual attraction. 


(a) Use Eq. 2.52 to express the work done by electrostatic forces, in terms of the field Z, and the 
area of the plates, A. 
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(b) Use Eq. 2.46 to express the energy lost by the field in this process. 


(This problem is supposed to be easy, but it contains the embryo of an alternative derivation of 
Eq. 2.52, using conservation of energy.) 


More Problems on Chapter 2 
SSS Saas 


Problem 2.46. Find the electric field at a height z above the center of a square sheet (side 
length a) carrying a uniform surface charge o. Check your result for the limiting cases a — co 


and z > a. Answer: (o/2€9){(4/72) tan™! V1 + (a2/2z7) — 1}] 


Problem 2.47. If the electric field in some region is given (in spherical coordinates) by the 
expression 


E(r) = Eis + 2sin 0 cos 0 sin Q Ê + sin 0 cos ¢ @], 
for some constant k, what is the charge density? [Answer: 3keo(1 + cos 20 sin pyr] 


Problem 2.48. Find the net force that the southern hemisphere of a uniformly charged solid 
sphere exerts on the northern hemisphere. Express your answer in terms of the radius R and 
the total charge Q. [Answer: (1/47¢€9)(307/16R7).] 


Problem 2.49. An inverted hemispherical bowl of radius R carries a uniform surface charge 
density o. Find the potential difference between the “north pole” and the center. [Answer: 
(Ro/2e9)(V2 = 1).] 


Problem 2.50. A sphere of radius R carries a charge density p(r) = kr (where k is a 
constant). Find the energy of the configuration. Check your answer by calculating it in at least 
two different ways. [Answer: rk?” R’/Teo.] 


Problem 2.51. The electric potential of some configuration is given by the expression 


—Ar 
Ve) = A—, 


where A and A are constants. Find the electric field E(x), the charge density o (r), and the 
total charge Q. [Answer: p = eg A(41063(r) — A2e-"/r)]] 


Problem 2.52. Find the potential on the rim of a uniformly charged disk (radius R, charge 
density o). [Hint: First show that V = k(oR/7t€0), for some dimensionless number k, 
which you can express as an integral. Then evaluate A analytically, if you can, or by 
computer.] 


Problem 2.53. Two infinitely long wires running parallel to the x-axis carry uniform charge 
densities +A and —A (Fig. 2.54). 
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Fig. 2.54 


(a) Find the potential at any point (x, y, z), using the origin as your reference. 
(b) Show that the equipotential surfaces are circular cylinders, and locate the axis and radius 
of the cylinder corresponding to a given potential Vo. 


Problem 2.54. In a vacuum diode, electrons “boil” off a hot cathode, at potential zero, and 
accelerate across a gap to the anode, whichis held at positive potential Vo. The cloud of moving 
electrons within the gap (called space charge) quickly builds up to the point where it reduces 
the field at the surface of the cathode to zero. From then on, a steady current Z flows between 
the plates. 

Suppose the plates are large relative to the separation (A >> d” in Fig. 2.55), so that edge 
effects can be neglected. Then V, p, and v (the speed of the electrons) are all functions of x 
alone. 


E g/— 


Electron 


Cathode Anode 
(V=0) (Vo) 


Fig. 2.55 


(a) Write Poisson's equation for the region between the plates. 

(b) Assuming the electrons start from rest at the cathode, what is their speed at point x, where 
the potential is V(x)? 

(c) In the steady state, Z is independent of x. What, then, is the relation between p and y? 

(d) Use these three results to obtain a differential equation for V, by eliminating p and v. 

(e) Solve this equation for V as a function of x, Vo, and d. Plot V (x), and compare it to the 
potential without space charge. Also, find p and vas functions of x. 

(f) Show that 


T=KvV,", (2.56) 
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and find the constant K. (Equation 2.56 is called the Child—Langmuir law. It holds for 
other geometries as well, whenever space charge limits the current. Notice that the space- 
charge-limited diode is nonlinear — it does not obey Ohm’s law.) 


Problem 2.55. Imagine that new and extraordinarily precise measurements have revealed an 
error in Coulomb's law. The actual force of interaction between two point charges is found to be 


where A is a new constant of nature (it has dimensions of length, obviously, and is a huge 
number — say half the radius of the known Universe — so the correction is small, which is why 
no one ever noticed the discrepancy before). You are charged with the task of reformulating 
electrostatics to accommodate the new discovery. Assume the principle of superposition still 
holds. 


(a) What is the electric field of a charge distribution p (replacing Eq. 2.8)? 

(b) Does this electric field admit a scalar potential? Explain briefly how you reached your 
conclusion. (No formal proof necessary — just a persuasive argument.) 

(c) Find the potential of a point charge q — the analog to Eq. 2.26. (If your answer to (b) was 
“no,” better go back and change it!) Use oo as your reference point. 

(d) Fora point charge g at the origin, show that 


where Sis the surface and V the volume of any sphere centered at q. 
(e) Show that this result generalizes: 


1 1 
G Edat- | Var = One 


for any charge distribution. (This is the next best thing to Gauss’s law, in the new 
“electrostatics.”) 

(f) Draw the triangle diagram (like Fig. 2.35) for this world, putting in all the appropriate 
formulas. (Think of Poisson's equation as the formula for p in terms of 7, and Gauss’s law 
(differential form) as an equation for p in terms of E.) 

(g) Show that some of the charge on a conductor distributes itself (uniformly!) over the 
volume, with the remainder on the surface. [Hint: E is still zero, inside a conductor.] 


Problem 2.56. Suppose an electric field E(x, y, z) has the form 
Ey, =ax, Ey=0, E,=0, 


where a is a constant. What is the charge density? How do you account for the fact that the 
field points in a particular direction, when the charge density is uniform? [This is a more subtle 
problem than it looks, and worthy of careful thought.] 
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Problem 2.57. All of electrostatics follows from the 1/7? character of Coulomb's law, together 
with the principle of superposition. An analogous theory can therefore be constructed for 
Newton's law of universal gravitation. What is the gravitational energy of a sphere, of mass M 
and radius R, assuming the density is uniform? Use your result to estimate the gravitational 
energy of the Sun (look up the relevant numbers). Note that the energy is negative — masses 
attract, whereas (like) electric charges repel. As matter “falls in,’ to create the Sun, its kinetic 
energy is converted into other forms (typically thermal), and is eventually released as radiation. 
The Sun radiates at a rate of 3.86 x 107° W; if all this came from gravitational energy, how 
long would the Sun last? [The Sun is in fact much older than that, so evidently this is not the 
source of its energy. '*] 


Problem 2.58. We know that the charge on a conductor goes to the surface, but just how it 
distributes itself there is not easy to determine. One famous example in which the surface charge 
density can be calculated explicitly is the ellipsoid: 


A A 
a Ra = 37 Z = 1. 
In this case,” 
Oe y 7e 
= re pt 


oe Gee (2.57) 


where Q is the total charge. By choosing appropriate values for a, b, and c, obtain (from 
Eq. 2.57) (a) the net (both sides) surface charge density o (r) on a circular disk of radius R; 
(b) the net surface charge density o (x) on an infinite conducting “ribbon” in the xy plane, 
which straddles the y-axis from x = —a to x = a (let A be the total charge per unit length 
of ribbon); (c) the net charge per unit length A(x) on a conducting “needle,” running from 
x = —ato x = a. In each case, sketch the graph of your result. 


Problem 2.59. 


(a) Consider an equilateral triangle, inscribed in a circle of radius a, with a point charge q at 
each vertex. The electric field is zero (obviously) at the center, but (surprisingly) there are 
three other points inside the triangle where the field is zero. Where are they? [Answer: r = 
0.285a— you'll probably need a computer to get it.] 

(b) Fora regular 7-sided polygon there are 7 points (in addition to the center) where the field 
is zero.” Find their distance from the center for n = 4 and n = 5. What do you suppose 
happens as n — co? 


'8 Lord Kelvin used this argument to counter Darwin’s theory of evolution, which called for 


a much older Earth. Of course, we now know that the source of the Sun’s energy is nuclear 
fusion, not gravity. 


19 For the derivation (which is a real tour de force), see W. R. Smythe, Static and Dynamic 


Electricity, 3rd ed., Hemisphere, New York (1989), Sect. 5.02. 


20 S, D. Baker, Am. J. Phys. 52, 265 (1984); D. Kiang and D. A. Tindall, Am. J. Phys. 53, 593 


(1985). 
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Problem 2.60. Prove or disprove (with a counterexample) the following: 


Theorem. Suppose a conductor carrying a net charge Q, when placed in an external 
electric field Ee, experiences a force F; if the external field is now reversed (E —> —E-), 
the force also reverses (F — —F). 


What if we stipulate that the external field is uniform? 


Problem 2.61. A point charge q is at the center of an uncharged spherical conducting shell, of 
inner radius a and outer radius b. Question: How much work would it take to move the charge 
out to infinity (through a tiny hole drilled in the shell)? [Answer: (q7/8 TEg)(l/a — 1/b).] 


Problem 2.62. What is the minimum-energy configuration for a system of NV equal point 
charges placed on or inside a circle of radius R?” Because the charge on a conductor goes 
to the surface, you might think the M charges would arrange themselves (uniformly) around 
the circumference. Show (to the contrary) that for X = 12 it is better to place 11 on the 
circumference and 1 at the center. How about for V = 11 (is the energy lower if you put all 
11 around the circumference, or if you put 10 on the circumference and 1 at the center)? [Hint: 
Do it numerically — you'll need at least 4 significant digits. Express all energies as multiples of 
ql4neoR] 


Problem 2.63. 


(a) Find the potential on the rim of a uniformly charged circular disk of radius r and charge 
density ø. [Hint: Chop the disk into circular arcs of radius 2 and width d%, running from 
—@ to +0, and integrate over 0.] 

(b) Use your result in (a) to calculate the energy of a uniformly charged disk of radius R and 
total charge Q. 

(c) A circular conducting disk of radius R carries a total charge Q. In Prob. 2.58a you showed 
that the charge density is 

o(r) = == ae 
Find the potential of the disk. 

(d) Use your result in (c) to calculate the energy of a conducting disk of radius R and total charge 

Q. Compare your answer to (b); which is smaller, and why? 


Problem 2.64. Many programming languages include packages for drawing field diagrams 
(like Fig. 2.12a). You input E(x, y, z), and the program generates a figure showing arrows 
representing the field at an array of representative points. In Mathematica it’s called Vec - 
torPlot. Suppose, for example, you want to sketch the field of a point charge q at the 
origin; in the xy plane the field is 


21 M. G. Calkin, D. Kiang, and D. A. Tindall, Am. J. Phys. 55, 157 (1987). 
22 For applications of these results see T. M. Minteer, Eur. J. Phys. 35, 035022 (2014). 
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= @ Ê qr q4 x«Iy 
~ 4neor? 4neor  4reo (x2 + y»? 


so, setting q/47reo — 1 and using the Mathematica syntax for defining a function, you 
type in 

mee pw i] ss bart (G2 e a aN aN 

and invoke 


VectorPlot [EF[{x,y}], {x,-1,1}, {y,-1,1}] 
(the terms in curly brackets specify the region of interest: -1 < x < land—-1 < y < 1). 


(a) Try it. If you want to change the number of arrows drawn (say, to 10 per row/column), type 


VectorPlot [EF[{x,y}], {x,-1,1}, {y,-1,1}, VectorPoints->10] 


Notice that in Mathematica the strength of the field is represented by the color of the 
arrows; if you prefer to indicate it by their /ength, insert (after a comma, as always) 
VectorScaling->Automatic; to turn off the color coding altogether, also 
insert VectorColorFunction->None. If you want to draw in the x- and 
y-axes, insert Axes ->True. Construct field diagrams for two charges, equal (as in 
Fig. 2.14), and opposite (as in Fig. 2.13). 

Mathematica can also draw the full three-dimensional field pattern, using 


= 
a 
= 


vector PotSDIERI 2yr chil, e e al}, 
VectorScaling->Automatic] 


Try it, for the field of a point charge at the origin. The result is rather congested; | prefer to 
insert VectorMarkers - >Arrow, which makes it a little easier to read. 


Problem 2.65. In Prob. 2.64 you drew field vectors at representative points; suppose instead 
you want to construct a field /ine passing through the point (xo, yo). In Mathematica you can 
doitusing StreamPlot: 


StreamPlot|EF|{x, y}], {x,-1,1}, {y,-1,1}, StreamPoints->{{xo,yo}}, 
StreamScale->Full, Axes- >True] 


(a) Fora point charge at the origin, draw the field line passing through (0.5, 0.5). 

(b) Drawa typical field line for the configurations in Figs. 2.13 and 2.14. 

(c) If you change the argument of StreamPoints to a number (for instance, writing 
StreamPoints->20),StreamP1lot will draw 20 representative field lines. 
Try it for the configurations in part (b). 


Problem 2.66. In Prob. 2.65 we used StreamPlot to draw electric field lines in two 
dimensions; it’s quick and easy, but it doesn’t tell you what the computer is actually doing. The 
basic idea is very simple: to get from one point on a field line (r(z)) to the next (r(¢ + dt)), 
you take a short step in the direction of the field: 


r(t + dt) =r(t) + E(t) dt. 
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Here tis any convenient parameter — perhaps arc length — that increases monotonically along 
the curve. In the limit of infinitesimal step size, 


dr 

dt 
This is an ordinary first-order differential equation for r(¢), and Mathematica can solve it 
(numerically) using NDSolve. First specify your electric field (EF), as in Prob. 2.64. Next, 
instruct the computer to calculate the coordinates (x(t), y(t), z(£)) of the field line: 


=E. 


FieldLine = NDSolve[ 


{ 
s |hel| == {elese e fl CLI, 
y hel ssiais I {seIKel ikl , allel Fl 201 , 
a” \ellasinis khel viel ,@ihell fl ISI 
20] ==40, 57 Ol =O, A Ol] Sera), (e557, 74) ie g Gl, ener] y 


In the last line you will insert appropriate numbers for the starting point (x0 , yO , zO), and 
choose how far out (tmax) you want to go. Finally, to draw the field line, type 


ParametricPlot3D[{x[t],y[t],z[t]}/.FieldLine, {t,0,tmax}] 


Beware: If the field line wanders too close to a point charge, where the field is huge and rapidly 
changing, the numerical solution may be unstable, and you'll either get an error message or the 
plot will develop weird kinks. 


(a) To make sure the code is working, reproduce the figures from Prob. 2.65a and b (but now in 
three dimensions). You might start out with tmax = 10, butbe prepared to adjust it 
as necessary. 

(b) The field lines in part (a) all lie in a plane. Construct an electric field line that does not lie in 
a plane, using a configuration of three point charges. 


Problem 2.67. A point charge q of mass mm moves in an electric field E(x, y, z). It starts 
out at Xo = (x9, Yo, Zo), with velocity vo = (vxo, Vo, Vz0), and you want to determine its 
trajectory. The following Mathematica code will do the job: 


r q. 
Module[{X, V, x, v, xprev, j}, 

K = WEVA, (ak, I, mills 

X[[1]] = x0; 

W = Wislelo., (a, Ly, whl; 

V[[1]] = vo; 

a = edhe 

Ww = aes 

interes] = 2B, S| cea im, 3) Se afl, 
xprev = x; 
mS se bw cle 2 / (2m) ey aaa Cle Ap 
v = 1/(2m) (q EF[xprev] + q EF[x]) dt; 
SIM = eee 
WIL WS) = WR 


lg 
Return[{X, V}; 
] 
ShowTrajectory[pts_] := 
Show [Graphics3D [Table [Line[{pts[[jj]], pts[[jj+1]]}], 
{jj, 1, Length[pts]-1}]]] 
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(a) Etraj takes as input the vectors Xo and vo, the values q, m, dt, n, and the electric field 
EF; it returns two lists: X (the position of the charge) and V (its velocity), at a succession 
of n times, separated by small intervals dt. The crucial steps are 


x 
v 


ge dk Gy Cle 2b A oj ie bal eleta 
v + 1/(2m) (q EF[xprev] + q EF[x]) dt; 


Wool 


Explain what they do. To generate the plot, you type 


Fl Vy = THEA) [oc oll B 
ShowTrajectory [X1] 


(b) As an example, suppose the field is produced by a (negative) point charge Q at the origin. 
This is mathematically identical to the classical Kepler problem; the trajectories are ellipses 
(if the total energy — kinetic plus potential — is negative), hyperbolic (if the energy is 
positive), and parabolic (if it is zero). Generate an example of each case. (You might as well 
setq = 1,m = 1,and O/47t€9 = —1.) 

(c) For something a little more exotic, try two (equal, negative) source charges, at 
(+0.5, 0, 0); start q at the origin, with initial velocity v = (1.02, 1.10, 0), and set 
dt = 1/20,000, n = 32,000. 


Potentials 


3.1 Laplace’s Equation 
Coo ü  —CsSCSC‘(SSC(is 


3.1.1 Introduction 


The primary task of electrostatics is to find the electric field of a given 
stationary charge distribution. In principle, this purpose is accomplished 
by Coulomb’s law, in the form of Eq. 2.8: 


i : 
E(r) = zS =p’) dt’. (3.1) 


Unfortunately, integrals of this type can be difficult to calculate for any but 
the simplest charge configurations. Occasionally we can get around this by 
exploiting symmetry and using Gauss’s law, but ordinarily the best strategy 
is first to calculate the potential, V, which is given by the somewhat more 
tractable Eq. 2.29: 


1 A ony 


Still, even this integral is often too tough to handle analytically. Moreover, 
in problems involving conductors, p itself may not be known in advance; 
since charge is free to move around, the only thing we control directly is the 
total charge (or perhaps the potential) of each conductor. 

In such cases, it is fruitful to recast the problem in differential form, using 
Poisson’s equation (2.24), 


1 
oras (3.3) 


which, together with appropriate boundary conditions, is equivalent to 
Eq. 3.2. Very often, in fact, we are interested in finding the potential in a 
region where p = 0. (If p = 0 everywhere, of course, then V = 0, and there 
is nothing further to say — that’s not what I mean. There may be plenty of 
charge elsewhere, but we’re confining our attention to places where there is 
no charge.) In this case, Poisson’s equation reduces to Laplace’s equation: 


VV =0, (3.4) 
or, written out in Cartesian coordinates, 


PV PV V 


Ox + oy + rey = (3.5) 
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This formula is so fundamental to the subject that one might almost 
say electrostatics is the study of Laplace’s equation. At the same time, it 
is a ubiquitous equation, appearing in such diverse branches of physics 
as gravitation and magnetism, the theory of heat, and the study of soap 
bubbles. In mathematics, it plays a major role in analytic function theory. 
To get a feel for Laplace’s equation and its solutions (which are called 
harmonic functions), we shall begin with the one- and two-dimensional 
versions, which are easier to picture, and illustrate all the essential properties 
of the three-dimensional case. 


3.1.2 Laplace's Equation in One Dimension 


Suppose V depends on only one variable, x. Then Laplace’s equation 
becomes 

PV 

dx? 


The general solution is 
V(x)=mx+b, (3.6) 


the equation for a straight line. It contains two undetermined constants (m 
and b), as is appropriate for a second-order (ordinary) differential equation. 
They are determined, in any particular case, by the boundary conditions of 
that problem. For instance, it might be specified that V = 4 at x = 1, and 
V =Oat x = 5. In that case, m = -1 and b = 5, so V = -x + 5 (Fig. 3.1). 

I want to call your attention to two features of this result; they may seem 
silly and obvious in one dimension, where I can write down the general 
solution explicitly, but the analogs in two and three dimensions are powerful 
and by no means obvious: 


(1) V(x) is the average of V(x + a) and V(x — a), for any a: 
V(x) = lva +a)+ V(x-a)]. 


Laplace’s equation is a kind of averaging instruction; it tells you to 
assign to the point x the average of the values to the left and to the 
right of x. Solutions to Laplace’s equation are, in this sense, as boring 
as they could possibly be, and yet fit the end points properly. 


FP N U A 
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(2) Laplace’s equation tolerates no local maxima or minima; extreme values 
of V must occur at the end points. Actually, this is a consequence 
of item (1), for if there were a local maximum, V would be greater 
at that point than on either side, and therefore could not be the 
average. (Ordinarily, you expect the second derivative to be negative 
at a maximum and positive at a minimum. Since Laplace’s equation 
requires, on the contrary, that the second derivative is zero, it seems 
reasonable that solutions should exhibit no extrema. However, this is 
not a proof, since there exist functions that have maxima and minima 
at points where the second derivative vanishes: xt, for example, at the 
point x = 0.) 


3.1.3 Laplace’s Equation in Two Dimensions 


If V depends on two variables, Laplace’s equation becomes 
PV aV 
—+— =0. 
Ox* dy? 
This is no longer an ordinary differential equation (that is, one involving 
ordinary derivatives only); it’s a partial differential equation. As a conse- 
quence, some of the simple rules you may be familiar with do not apply. 
For instance, the general solution to this equation doesn’t contain just two 
arbitrary constants — or, for that matter, any finite number — despite the 
fact that it’s a second-order equation. Indeed, one cannot write down a 
“general solution” (at least, not in a closed form like Eq. 3.6). Nevertheless, 
it is possible to deduce certain properties common to all solutions. 

It may help to have a physical example in mind. Picture a thin rubber 
sheet (or a soap film) stretched over some support. For definiteness, suppose 
you take a cardboard box, cut a wavy line all the way around, and remove 
the top part (Fig. 3.2). Now glue a tightly stretched rubber membrane over 
the box, so that it fits like a drum head (it won’t be a flat drumhead, of 
course, unless you chose to cut the edges off straight). Now, if you lay 
out coordinates (x,y) on the bottom of the box, the height V(x, y) of 
the sheet above the point (x, y) will satisfy Laplace’s equation.' (The one- 
dimensional analog would be a rubber band stretched between two points. 
Of course, it would form a straight line.) 

Harmonic functions in two dimensions have the same properties we 
noted in one dimension: 


(1) The value of V at a point (x,y) is the average of those around the 
point. More precisely, if you draw a circle of any radius R about the 


' Actually, the equation satisfied by a rubber sheet is 
ð ( ð a 5 ð ( OV 
Ox s Ox Oy s oy 


ete | 


)=0, where g = [1 + (3 dy ; 


it reduces (approximately) to Laplace’s equation as long as the surface does not deviate too 
radically from a plane. 
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point (x, y), the average value of V on the circle is equal to the value at 
the center: 


1 
V(x, y) = mR V di. 


circle 


(This, incidentally, suggests the method of relaxation, on which many 
computer solutions to Laplace’s equation are based: Starting with 
specified values for V at the boundary, and reasonable guesses for V on 
a grid of interior points, the first pass reassigns to each point the average 
of its nearest neighbors. The second pass repeats the process, using the 
corrected values, and so on. After a few iterations, the numbers begin to 
settle down, so that subsequent passes produce negligible changes, and 
a numerical solution to Laplace’s equation, with the given boundary 
values, has been achieved.”) 

V has no local maxima or minima; all extrema occur at the boundaries. 
(As before, this follows from property 1.) Again, Laplace’s equation 
picks the most featureless function possible, consistent with the bound- 
ary conditions: no hills, no valleys, just the smoothest conceivable 
surface. For instance, if you put a ping-pong ball on the stretched 
rubber sheet of Fig. 3.2, it will roll over to one side and fall off — it 
will not find a “pocket” somewhere to settle into, for Laplace’s equation 
allows no such dents in the surface. From a geometrical point of view, 
just as a straight line is the shortest distance between two points, so 
a harmonic function in two dimensions minimizes the surface area 
spanning the given boundary line. 


3.1.4 Laplace's Equation in Three Dimensions 


In three dimensions I can neither provide you with an explicit solution (as 
in one dimension) nor offer a suggestive physical example to guide your 
intuition (as I did in two dimensions). Nevertheless, the same two properties 
remain true, and this time I will sketch a proof.’ 


2 See Prob. 3.65. 


3 Fora proof that does not rely on Coulomb’s law (only on Laplace’s equation), see Prob. 3.39. 


3.1 Laplace's Equation 


(1) The value of V at point r is the average value of V over a spherical 
surface of radius R centered at r: 


1 
V(r) = IRRI $ V da. 


sphere 


(2) As a consequence, V can have no local maxima or minima; the extreme 
values of V must occur at the boundaries. (For if V had a local 
maximum at r, then by the very nature of maximum I could draw a 
sphere around r over which all values of V — and a fortiori the average — 
would be less than at r.) 


Proof. Lets begin by calculating the average potential over a spher- 
ical surface of radius R due to a single point charge q located outside 
the sphere. We might as well center the sphere at the origin and choose 
coordinates so that q lies on the z-axis (Fig. 3.3). The potential at a 
point on the surface is 
l q 
rz ATED x 


where 
2 =z + R- 2zRcos0 
(by the law of cosines), so 


1 q 


Vaa = 
we An R2 4rteo 


fe + R? — 2zR cos OJ"? R? sin 0 d0 do 


= —v2z2 + R? — 2zR cos o| 
4TEg 2ZR 0 


q 1l 1l q 
= —[(z+ R) - (z - R)| = = 
4m€9 2zR ier )] ÅTEQ Z 


But this is precisely the potential due to q at the center of the sphere! 
By the superposition principle, the same goes for any collection of 
charges outside the sphere: their average potential over the sphere is 
equal to the net potential they produce at the center. (m 
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Problem 3.1. Find the average potential over a spherical surface of radius R due to a point 
charge q located inside (same as above, in other words, only with z < R). (In this case, of course, 
Laplace's equation does not hold within the sphere.) Show that, in general, 

Qene 
4neoR , 
where Venter is the potential at the center due to all the external charges, and Qenc is the 
total enclosed charge. 


Vave = Veenter + 


Problem 3.2. In one sentence, justify Earnshaw’s theorem: A charged particle cannot be held 
in stable equilibrium (in otherwise empty space) by electrostatic forces alone. As an example, 
consider the cubical arrangement of eight fixed charges in Fig. 3.4. It looks, off hand, as though 
a positive charge at the center would be suspended in midair, since it is repelled away from 
each corner. Where is the leak in this “electrostatic bottle’? [To harness nuclear fusion as a 
practical energy source it is necessary to heat a plasma (soup of charged particles) to fantastic 
temperatures — so hot that contact would vaporize any ordinary pot. Earnshaw’s theorem says 
that electrostatic containment is also out of the question. Fortunately, it is possible to confine a 
hot plasma magnetically.] 


Problem 3.3. Find the general solution to Laplace's equation in spherical coordinates, for the 
case where V depends only on r. Do the same for cylindrical coordinates, assuming V depends 
only on s. 


Problem 3.4. 


(a) Show that the average electric field over a spherical surface, due to charges outside the 
sphere, is the same as the field at the center. 
(b) Whatis the average due to charges inside the sphere? 


3.1.5 Boundary Conditions and Uniqueness Theorems 


Laplace’s equation does not by itself determine V; in addition, suitable 
boundary conditions must be imposed. This raises a delicate question: 
What are appropriate boundary conditions, sufficient to determine the 
answer and yet not so strong as to generate inconsistencies? The one- 
dimensional case is easy, for here the general solution V = mx + b contains 
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two arbitrary constants, and we therefore require two boundary conditions. 
We might, for instance, specify the value of the function at each end, or 
we might give the value of the function and its derivative at one end, or 
the value at one end and the derivative at the other, and so on. But we 
cannot get away with just the value or just the derivative at one end — this is 
insufficient information. Nor would it do to specify the derivatives at both 
ends — this would either be redundant (if the two are equal) or inconsistent 
(if they are not). 

In two or three dimensions we are confronted by a partial differen- 
tial equation, and it is not so obvious what would constitute acceptable 
boundary conditions. Is the shape of a taut rubber membrane, for instance, 
uniquely determined by the frame over which it is stretched, or, like a 
canning jar lid, can it snap from one stable configuration to another? 
The answer, as I think your intuition would suggest, is that V is uniquely 
determined by its value at the boundary (canning jar lids evidently do 
not obey Laplace’s equation). However, other boundary conditions can 
also be used (see Prob. 3.5). The proof that a proposed set of boundary 
conditions will suffice is usually presented in the form of a uniqueness 
theorem. There are many such theorems for electrostatics, all sharing the 
same basic format — I’ll show you the two most useful ones.* 


First Uniqueness Theorem. The solution to Laplace’s equation in some 
volume V is uniquely determined if V is specified on the boundary 
surface S.° 


Proof. In Fig. 3.5 I have drawn such a region and its boundary. 
(There could also be “islands” inside, so long as V is given on all 
their surfaces; also, the outer boundary could be at infinity, where 
V is ordinarily taken to be zero.) Suppose there were two solutions to 
Laplace’s equation: 


WV,=0 and WV =0, 


V wanted in 
this volume 
V specified v) 
on this 


surface (S) 


4 I do not propose to prove the existence of solutions here — that’s a much more difficult job. 
In context, the existence is generally clear on physical grounds. 

5 Pm not saying it is constant over the surface — it may well vary from point to point, but at 
each point it takes on the specified value. 
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both of which assume the specified value on the surface. I want to 
prove that they must be equal. The trick is look at their difference: 


V3=Vi- Va: 
This obeys Laplace’s equation, 
VV = V? Vi - V? V2 = 0, 
and it takes the value zero on all boundaries (since V; and V2 are equal 
there). But Laplace’s equation allows no local maxima or minima — 


all extrema occur on the boundaries. So the maximum and minimum 
of V3 are both zero. Therefore V3 must be zero everywhere, and hence 


Y, = V2. o 


Example 3.1. Show that the potential is constant inside an enclosure 
completely surrounded by a conductor, provided there is no charge 
within the enclosure. 


Solution. The potential on the cavity wall is some constant, Vo (that’s 
item (iv), in Section 2.5.1), so the potential inside is a function that 


satisfies Laplace’s equation and has the constant value Vo at the 
boundary. It doesn’t take a genius to think of one solution to this 
problem: V = Vo everywhere. The uniqueness theorem guarantees that 
this is the only solution. (It follows that the field inside an empty cavity 
is zero — the same result we found in Section 2.5.2 on rather different 
grounds.) 


The uniqueness theorem is a license to your imagination. It doesn’t 
matter how you come by your solution; if (a) it satisfies Laplace’s equation 
and (b) it has the correct value on the boundaries, then it’s right. You'll see 
the power of this argument when we come to the method of images. 

Incidentally, it is easy to improve on the first uniqueness theorem: 
I assumed there was no charge inside the region in question, so the potential 
obeyed Laplace’s equation, but we may as well throw in some charge (in 
which case V obeys Poisson’s equation). The argument is the same, only 
this time 

2 1 2 1 
V Hom r V E 
so 
2 2 2 1 1 
V V3=V V-V V =-—p+—p=0. 
€0 €0 


Once again the difference (V3 = Vı — V2) satisfies Laplace’s equation and 
has the value zero on all boundaries, so V3 = 0 and hence V1 = Vp. 


Corollary. The potential in a volume V is uniquely determined if (a) 
the charge density throughout the region, and (b) the value of V on 
all boundaries, are specified. 
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3.1.6 Conductors and the Second Uniqueness Theorem 


The simplest way to set the boundary conditions for an electrostatic prob- 
lem is to specify the value of V on all surfaces surrounding the region of 
interest. And this situation often occurs in practice: In the laboratory, we 
have conductors connected to batteries, which maintain a given potential, 
or to ground, which is the experimentalist’s word for V = 0. However, 
there are other circumstances in which we do not know the potential at the 
boundary, but rather the charges on various conducting surfaces. Suppose I 
put charge Q, on the first conductor, Q, on the second, and so on—I’m not 
telling you how the charge distributes itself over each conducting surface, 
because as soon as I put it on, it moves around in a way I do not control. 
And for good measure, let’s say there is some specified charge density p 
in the region between the conductors. Is the electric field now uniquely 
determined? Or are there perhaps a number of different ways the charges 
could arrange themselves on their respective conductors, each leading to a 
different field? 


Second Uniqueness Theorem. In a volume V surrounded by conduc- 
tors and containing a specified charge density p, the electric field 
is uniquely determined if the total charge on each conductor is 
given (Fig. 3.6). (The region as a whole can be bounded by another 
conductor, or else unbounded.) 


Proof. Suppose there are two fields satisfying the conditions of the 


problem. Both obey Gauss’s law in differential form in the space 
between the conductors: 


And both obey Gauss’s law in integral form for a Gaussian surface 
enclosing each conductor: 


Integration surfaces 


—— aero 
== = 


ae 
ie = 


Outer boundary — 
could be at infinity 
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1 1 
$ Moas Os $ Bras oe 


ith conducting ith conducting 
surface surface 


Likewise, for the outer boundary (whether this is just inside an 
enclosing conductor or at infinity), 


1 1 
$ E; -da = — Qio, $ E, - da = — Qiot- 
€0 €0 


outer outer 
boundary boundary 


As before, we examine the difference 
E; = E; - Ep, 


which obeys 
V-E;=0 (3.7) 


in the region between the conductors, and 


pes -da=0 (3.8) 


over each boundary surface. 

Now there is one final piece of information we must exploit: 
Although we do not know how the charge Q; distributes itself over the 
ith conductor, we do know that each conductor is an equipotential, 
and hence V3 is a constant (not necessarily the same constant) over 
each conducting surface. (It need not be zero, for the potentials V; and 
Vz may not be equal — all we know for sure is that both are constant 
over any given conductor.) Next comes a trick. Invoking product rule 
number 5 (inside front cover), we find that 


V : (V3E3) = V3(V - E3) + E3 - (WV3) = -(23)’. 


Here I have used Eq. 3.7, and E; =- V V3. Integrating this over V, and 
applying the divergence theorem to the left-hand side, 


[v vah VEs da=- | (E? dr. 
y S y 


The surface integral covers all boundaries of the region in question — 
the conductors and outer boundary. Now V3 is a constant over each 
surface (if the outer boundary is infinity, V3 = 0 there), so it comes 
outside each integral, and what remains is zero, according to Eq. 3.8. 


Therefore, 
f (E3)? dt =0. 
Vv 


But this integrand is never negative; the only way the integral can 
vanish is if E3 = 0 everywhere. Consequently, E; = Ez, and the 
theorem is proved. Oo 


13 


3.1 Laplace's Equation 


2 Ge 


This proof was not easy, and there is a real danger that the theorem 
itself will seem more plausible to you than the proof. In case you think 
the second uniqueness theorem is “obvious,” consider this example of 
Purcell’s: Figure 3.7 shows a simple electrostatic configuration, consisting 
of four conductors with charges +Q, situated so that the plusses are near the 
minuses. It all looks very comfortable. Now, what happens if we join them 
in pairs, by tiny wires, as indicated in Fig. 3.8? Since the positive charges are 
very near negative charges (which is where they /ike to be) you might well 
guess that nothing will happen — the configuration looks stable. 

Well, that sounds reasonable, but it’s wrong. The configuration in 
Fig. 3.8 is impossible. For there are now effectively two conductors, and 
the total charge on each is zero. One possible way to distribute zero charge 
over these conductors is to have no accumulation of charge anywhere, and 
hence zero field everywhere (Fig. 3.9). By the second uniqueness theorem, 


this must be the solution: the charge will flow down the tiny wires, canceling 
itself off. 


Problem 3.5. Prove that the field is uniquely determined when the charge density p is given 
(throughout V) and either V or its normal derivative 0 V/On is specified on each boundary 
surface. [Do not assume the boundaries are conductors, or that V is constant over any given 
surface. Of course, any stipulated ô V/On must be consistent with Gauss’s law; if, for instance, 
ô V lðnis given on all surfaces, then the charge enclosed by every surface has been stipulated, 
andit had better be the case that the outer boundary encloses the sum of all the internal charges 
— otherwise no solution exists at all. You won't need this fact to solve the problem, but I mention 
it just in case you are wondering whether any old 0 V/On defines a possible configuration; it 
does not.] 


Problem 3.6. A more elegant proof of the second uniqueness theorem uses Green's identity 
(Prob. 1.61c), with T = U = V3. Supply the details. 
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3.2 The Method of Images 


el 
3.2.1 The Classic Image Problem 


Suppose a point charge q is held a distance d above an infinite grounded 
conducting plane (Fig. 3.10). Question: What is the potential in the region 
above the plane? It’s not just (1/47t€9)q/z, for q will induce a certain 
amount of negative charge on the nearby surface of the conductor; the total 
potential is due in part to q directly, and in part to this induced charge. But 
how can we possibly calculate the potential, when we don’t know how much 
charge is induced or how it is distributed? 

From a mathematical point of view, our problem is to solve Poisson’s 
equation in the region z > 0, with a single point charge q at (0, 0, d), subject 
to the following boundary conditions: 


(1) V =0 when z = 0 (since the conducting plane is grounded), and 
(2) V > 0 far from the charge (that is, for x? + y? + 2 > d’). 


The first uniqueness theorem (actually, its corollary) guarantees that there is 
only one function that meets these requirements. If by trick or clever guess 
we can discover such a function, it’s got to be the answer. 

Trick: Forget about the actual problem; we’re going to study a completely 
different situation. This new configuration consists of two point charges, +q 
at (0, 0, d) and —qat (0, 0, —d), and no conducting plane (Fig. 3.11). For this 
configuration, I can easily write down the potential: 


V(x,y,Z) = (3.9) 


aS ESS 
4reo l fx? +y + (z— dy? x2 +y + (2 +d) 


(The denominators represent the distances from (x, y, Z) to the charges +q 
and —q, respectively.) It follows that 


(1) V =0 when z = 0, 
D Vo Oforx?+y+2 > d, 


and the only charge in the region z > 0 is the point charge +q at (0,0, d). 
But these are precisely the conditions of the original problem! Evidently, 


za 
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the second configuration happens to produce exactly the same potential 
as the first configuration, in the “upper” region z > 0. (The “lower” region, 
z < 0, is completely different, but who cares? The upper part is all we need.) 
Conclusion: The potential of a point charge above an infinite grounded 
conductor is given by Eq. 3.9, for z > 0. 

Notice the crucial role played by the uniqueness theorem in this argu- 
ment: without it, no one would believe this solution, since it was obtained 
for a completely different charge distribution. But the uniqueness theorem 
certifies it: if it satisfies Poisson’s equation in the region of interest, and 
assumes the correct value at the boundaries, then it must be right. 


3.2.2 Induced Surface Charge 


Now that we know the potential, it is a straightforward matter to compute 
the surface charge o induced on the conductor. According to Eq. 2.49, 


ae Pas 
= An’ 


where 0V/0n is the normal derivative of V at the surface. In this case the 
normal direction is the z direction, so 


OV 
o Ee | ai 
From Eq. 3.9, 
ôV ıl -q(z-d) q(z+d) 
dz Ane ( [x2 +y? +z- dpp [x +y? + (2 +d)? 3? } 
sof 


Q(x? + y2? + d?2)32" (2:10) 


o(x, y) = 


As expected, the induced charge is negative (assuming q is positive) and 
greatest at x = y = 0. 
While we're at it, let’s compute the total induced charge 


Q= | oda 


This integral, over the xy plane, could be done in Cartesian coordinates, 
with da = dx dy, but it’s a little easier to use polar coordinates (r, @), with 
17? = xX? + y? and da = r dr do. Then 


oS 2n (r? + d?)3/2’ 


and 
27 co 
_ -qd ; 7 qd oo 
o= f Í m Bye de = shy Tt (3.11) 


6 For an entirely different derivation of this result, see Prob. 3.40. 
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The total charge induced on the plane is —g, as (with benefit of hindsight) 
you can perhaps convince yourself it had to be. 


3.2.3 Force and Energy 


The charge q is attracted toward the plane, because of the negative induced 
charge. Let’s calculate the force of attraction. Since the potential in the 
vicinity of q is the same as in the analog problem (the one with +q and 
—q but no conductor), so also is the field and, therefore, the force: 
1 g 
4T €o (2d)? 
Beware: It is easy to get carried away, and assume that everything is the 


same in the two problems. Energy, however, is not the same. With the two 
point charges and no conductor, Eq. 2.42 gives 


2. (3.12) 


1 ¢ 
Wam oT (3.13) 
But for a single charge and conducting plane, the energy is half of this: 
1 g 
=- —. 3.14 
ue ATED 4d ( ) 
Why half? Think of the energy as stored in the fields (Eq. 2.45): 
W = = I Edr. 


In the two-charge analog (Fig. 3.11), both the upper region (z > 0) and the 
lower region (z < 0) contribute — and by symmetry they contribute equally. 
But in the original configuration (Fig. 3.10), only the upper region contains 
a nonzero field, and hence the energy is half as great.’ 

Of course, one could also determine the energy by calculating the work it 
takes to bring q in from infinity. The force required (to oppose the electrical 
force in Eq. 3.12) is (1/47t€9)(q7/4z")z, so 


d a 2 
l q 

= F-dl= — dz 
wW Í. ATE 9 i 42? 


_ 1 g do 1 g 
= a 


co ~ Are 4d’ 

As I move q toward the conductor, I do work only on q. It is true that induced 
charge is moving in over the conductor, but this costs me nothing, since the 
whole conductor is at potential zero. By contrast, if I simultaneously bring 
in two point charges (with no conductor), I do work on both of them, and 
the total is (again) twice as great. 


7 Fora generalization of this result, see M. M. Taddei, T. N. C. Mendes, and C. Farina, Eur. 
J. Phys. 30, 965 (2009), and Prob. 3.45b. 
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3.2.4 Other Image Problems 


The method just described is not limited to a single point charge; any 
stationary charge distribution near a grounded conducting plane can be 
treated in the same way, by introducing its mirror image — hence the name 
method of images. (Remember that the image charges have the opposite sign; 
this is what guarantees that the xy plane will be at potential zero.) There are 
also some exotic problems that can be handled in similar fashion; the nicest 
of these is the following. 


Example 3.2. A point charge q is situated a distance a from the center 
of a grounded conducting sphere of radius R < a (Fig. 3.12). Find the 
potential outside the sphere. 


Solution. Examine the completely different configuration, consisting 
of the point charge g together with another point charge 


,_ R 
q =-74 (3.15) 
placed a distance 
R 
b= — (3.16) 
a 


to the right of the center of the sphere (Fig. 3.13). No conductor now 
— just the two point charges. The potential of this configuration is 


(2+ T) (3.17) 


oS Ante 
where 2 and 2’ are the distances from q and q’, respectively. Now, it 
happens (see Prob. 3.8) that this potential vanishes at all points on 
the sphere (r= R), and therefore fits the boundary conditions for our 
original problem, in the exterior region. Conclusion: Equation 3.17 is 
the potential of a point charge near a grounded conducting sphere. 


8 This solution is due to William Thomson (later Lord Kelvin), who published it in 1848, 
when he was just 24. It was apparently inspired by a theorem of Apollonius (200 BC) that 
says the locus of points with a fixed ratio of distances from two given points is a sphere. 
See J. C. Maxwell, Treatise on Electricity and Magnetism, Vol. 1, Dover, New York (1954), 
p. 245. I thank Gabriel Karl for this interesting history. 
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(Notice that b is less than R, so the “image” charge q’ is safely inside 
the sphere — you cannot put image charges in the region where you are 
calculating V; that would change p, and you'd be solving Poisson’s 
equation with the wrong source.) In particular, the force of attraction 
between the charge and the sphere is 

1 qq 1 P Ra 


= Gres @ = RP as 


= 4mey (a — b)? 


The method of images is delightfully simple ... when it works. But it 
is as much an art as a science, for you must somehow think up just the 
right “auxiliary” configuration, and for most shapes this is forbiddingly 
complicated, if not impossible. 


Problem3.7. Find the force on the charge +q in Fig. 3.14. (The xy plane is a grounded 
conductor.) 


Problem3.8. 


(a) Using the law of cosines, show that Eq. 3.17 can be written as 


1 q q | 


4reo LV72 + a2 — 2ra cos O R? + (ral R)? — 2ra cos 0 
(3.19) 


where r and @ are the usual spherical polar coordinates, with the z-axis along the line 
through q. In this form, it is obvious that V = O on the sphere, r = R. 

(b) Find the induced surface charge on the sphere, as a function of O. Integrate this to get the 
total induced charge. (What should it be?) 

(c) Calculate the energy of this configuration. 


V(r, 0) = 


Problem 3.9. Find the force on a point charge q, a distance a from the center of a grounded 
spherical conducting shell of inner radius R, ifa < R. 


Problem 3.10. In Ex. 3.2 we assumed that the conducting sphere was grounded (V = 0). But 
with the addition of a second image charge, the same basic model will handle the case of a 
sphere at any potential Vo (relative, of course, to infinity). What charge should you use, and 
where should you put it? Find the force of attraction between a point charge q and a neutral 
conducting sphere. 
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3.3 Separation of Variables 


Problem 3.11. Find the force between a point charge g and a nearby spherical conductor 
which also carries (total) charge q (use the method of Prob. 3.10). Show that this force is actually 
attractive (!) ifthe separation distance ais less than a critical value ac. What és that critical value? 
[Answer: R(1 + V5)/2.] 


Problem 3.12. A uniform line charge A is placed on an infinite straight wire, a distance d 
above a grounded conducting plane. (Let's say the wire runs parallel to the x-axis and directly 
above it, and the conducting plane is the xy plane.) 


(a) Find the potential in the region above the plane. [Hint: Refer to Prob. 2.53.] 
(b) Find the charge density o induced on the conducting plane. 


Problem 3.13. Two semi-infinite grounded conducting planes meet at right angles. In the 
region between them, there is a point charge q, situated as shown in Fig. 3.15. Set up the image 
configuration, and calculate the potential in this region. What charges do you need, and where 
should they be located? What is the force on q? How much work did it take to bring q in from 
infinity? Suppose the planes met at some angle other than 90°; would you still be able to solve 
the problem by the method of images? If not, for what particular angles does the method work? 


yA y 


Fig. 3.15 Fig. 3.16 


Problem 3.14. Two long, straight copper pipes, each of radius R, are held a distance 2d apart. 
One is at potential Vo, the other at — Vo (Fig. 3.16). Find the potential everywhere. [Hint: 
Exploit the result of Prob. 2.53.] 


3.3 Separation of Variables 
a a 


In this section we attack Laplace’s equation directly, using the method of 
separation of variables, which is the physicist’s favorite tool for solving par- 
tial differential equations. The method is applicable in circumstances where 
the potential (V) or the charge density (ø) is specified on the boundaries of 
some region, and we are asked to find the potential in the interior. The basic 
strategy is very simple: we look for solutions that are products of functions, 
each of which depends on only one of the coordinates. The algebraic details, 
however, can be formidable, so I’m going to develop the method through a 
sequence of examples. We’ll start with Cartesian coordinates and then do 
spherical coordinates (I’ll leave the cylindrical case for you to tackle on your 
own, in Prob. 3.26). 
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Example 3.3. Two infinite grounded metal plates lie parallel to the 
xz plane, one at y = 0, the other at y = a (Fig. 3.17). The left end, at 
x = 0, is closed off with an infinite strip insulated from the two plates, 
and maintained at a specified potential Vo(y). Find the potential inside 
this “slot.” 


Solution. The configuration is independent of z, so this is really a two- 
dimensional problem. In mathematical terms, we must solve Laplace’s 
equation, 


2 2 
+ oe = 0, (3.20) 
subject to the boundary conditions 
Gi) V=0 when y = 0, 
(li) V=0 when y = a, (3.21) 
(ii) V = Voy) when x = 0, 
(iv) V0 as x > œ, 


(The latter, although not explicitly stated in the problem, is necessary 
on physical grounds: as you get farther and farther away from the “hot” 
strip at x = 0, the potential should drop to zero.) Since the potential is 
specified on all boundaries, the answer is uniquely determined. 

The first step is to look for solutions in the form of products: 


V(x, y) = X(x) YQ). (3.22) 


On the face of it, this is an absurd restriction — the overwhelming 
majority of solutions to Laplace’s equation do not have such a form. 
For example, V(x,y) = (5x + 6y) satisfies Eq. 3.20, but you can’t 
express it as the product of a function of x times a function of y. 
Obviously, we’re only going to get a tiny subset of all possible solutions 
by this means, and it would be a miracle if one of them happened to 
fit the boundary conditions of our problem. But hang on, because the 
solutions we do get are very special, and it turns out that by pasting 
them together we can construct the general solution. 
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3.3 Separation of Variables 


Anyway, putting Eq. 3.22 into Eq. 3.20, we obtain 


ax ey 
ee ee 
dx? dy? 
The next step is to “separate the variables” (that is, collect all the x- 
dependence into one term and all the y-dependence into another). 


Typically, this is accomplished by dividing through by V: 
Lax lay _ 
X d Yd 
Here the first term depends only on x and the second only on y; in 
other words, we have an equation of the form 


f(x) + g(y) = 0. (3.24) 


Now, there’s only one way this could possibly be true: f and g must 
both be constant. For if f (x) changed, as you vary x, then by holding y 
fixed and fiddling with x, the sum f(x)+g(yv) would change, in violation 
of Eq. 3.24, which says it’s always zero. (That’s a simple but somehow 
rather elusive argument; don’t accept it without due thought, because 
the whole method rides on it.) 
It follows from Eq. 3.23, then, that 

1 ËX lay 

X de Pie ane Y dy 
One of these constants is positive, the other negative (or perhaps both 
are zero). In general, one must investigate all the possibilities; however, 
in this particular problem we need C, positive (call it k’) and C2 
negative, for reasons that will appear in a moment. Thus 

d°’X dY 
LLRX, kY. 3.26 

dx? dy? O 
Notice what has happened: a partial differential equation (3.20) has 
been converted into two ordinary differential equations (3.26). The 
advantage of this is obvious — ordinary differential equations are a lot 
easier to solve. Indeed, 


Y =0. 


(3.23) 


=C, with Ci + C = 0. (3.25) 


X(x) = Ae™ + Be™, Y(y) = Csin ky + Dcos ky, 
so 
V(x, y) = (Ae™ + Be™™)(C sin ky + D cos ky). (3.27) 
This is the appropriate separable solution to Laplace’s equation; it 
remains to impose the boundary conditions, and see what they tell us 


about the constants. To begin at the end, condition (iv) requires that A 
equal zero.’ Absorbing B into C and D, we are left with 


I’m assuming k is positive, but this involves no loss of generality — negative k gives the 


same solution (Eq. 3.27), only with the constants shuffled (A e B, C > —C). Occasionally 
(though not in this example), k = 0 must also be included (see Prob. 3.59). 
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V(x,y) = e*(Csin ky + D cos ky). 
Condition (i) now demands that D equal zero, so 
V(x, y) = Ce™ sin ky. (3.28) 
Meanwhile, (ii) yields sin ka = 0, from which it follows that 
k= (n= 1,2,3,...). (3.29) 


(At this point you can see why I chose C; positive and C2 negative: if X 
were sinusoidal, we could never arrange for it to go to zero at infinity, 
and if Y were exponential we could not make it vanish at both 0 and 
a. Incidentally, n = 0 is no good, for in that case the potential vanishes 
everywhere. And we have already excluded negative ns.) 

That’s as far as we can go, using separable solutions, and unless 
Vo(y) just happens to have the form sin(n7ty/a) for some integer n, we 
simply can’t fit the final boundary condition at x = 0. But now comes 
the critical step that redeems the method: separation of variables has 
yielded an infinite family of solutions (one for each n), and whereas 
none of them by itself satisfies the final boundary condition, it is 
possible to combine them in a way that does. Laplace’s equation is 
linear, in the sense that if V1, V2, V3,... satisfy it, so does any linear 
combination, V = a,V, + &2 V2 + a3V3 + ---, where a1, Q2,... are 
arbitrary constants. For 


VV = V? Vi + aV? V2 +--+ = 0a; +002 4+---=0. 


Exploiting this fact, we can patch together the separable solutions 
(Eq. 3.28) to construct a much more general solution: 


V(x, y) = > Cpe ™™" sin(nmyla). (3.30) 

n=l 
This still satisfies three of the boundary conditions; but can we (by ast- 
ute choice of the coefficients C,,) fit the final boundary condition (iii), 


V(0,y) = > C, sin(nnyla) = Vo(y)? (3.31) 
n=1 

Well, you may recognize this sum — it’s a Fourier sine series. And 
Dirichlet’s theorem!’ guarantees that virtually any function Vo(y) — it 
can even have a finite number of discontinuities — can be expanded in 

such a series. 
But how do we actually determine the coefficients C,,, buried as they 
are in that infinite sum? The device for accomplishing this is so lovely 
it deserves a name — I call it Fourier’s trick, though it seems Euler had 


10 M. Boas, Mathematical Methods in the Physical Sciences, 3rd ed., John Wiley, New York 


(2006), p. 356. 
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used essentially the same idea somewhat earlier. Here’s how it goes: 
multiply Eq. 3.31 by sin(v’my/a) (where n’ is a positive integer), and 
integrate from 0 to a: 


peal sin(n7ty/a) sin(n' Tyla) a= f Vo(y) sin(n’ ryla) dy. 
0 0 


n=1 
(3.32) 
You can work out the integral on the left for yourself; the answer is 
z 0 ifn’ #n, 
if sin(n7y/a) sin(n’ yla) dy = f A (3.33) 
0 al2 ifn’ =n. 
Thus all the terms in the series drop out, save only the one where n = n’, 
and the left-hand side of Eq. 3.32, reduces to (a/2)C,,. Conclusion:"' 


C, = zf Vo(y) sin(nryla) dy. (3.34) 
0 


That does it: Eq. 3.30 is the solution, with coefficients given by 
Eq. 3.34. As a concrete example, suppose the strip at x = 0 is a metal 
plate with constant potential Vo (remember, it’s insulated from the 
grounded plates at y = 0 and y = a). Then 


2 a 2V, 
C= == sin(nty/a) dy = 22a — cos nT) 
a 0 NTC 


_ 0 s n " even, (3.35) 
4Vo/nn if nis odd. 


Thus 


—ntxla 


V(x, y) = to Le sin(u7yla). (3.36) 


Figure 3.18 is a plot of this potential; Fig. 3.19 shows how the first few 
terms in the Fourier series combine to make a better and better approx- 
imation to the constant Vo: (a) is n = 1 only, (b) includes n up to 5, (c) 
is the sum of the first 10 terms, and (d) is the sum of the first 100 terms. 


yla 1.0 
0.5 


doesn’t matter (obviously) what letter you use for the “dummy” index. 


,---, and it 


134 


3 Potentials 


Incidentally, the infinite series in Eq. 3.36 can be summed explicitly (try 
your hand at it, if you like); the result is!? 


sin(7ty/a) ) 
sinh(rmx/a)” 


2V; 
V(x, y) = ee tan“! ( 


(3.37) 


In this form, it is easy to check that Laplace’s equation is obeyed and 
the four boundary conditions (Eq. 3.21) are satisfied. 


The success of this method hinged on two extraordinary properties of 
the separable solutions (Eqs. 3.28 and 3.29): completeness and orthogonal- 
ity. A set of functions f(y) is said to be complete if any other function f (y) 
can be expressed as a linear combination of them: 


£0) =) Ghd. (3.38) 
n=1 

The functions sin(n7ty/a) are complete on the interval 0 < y < a. It was 
this fact, guaranteed by Dirichlet’s theorem, that assured us Eq. 3.31 could 
be satisfied, given the proper choice of the coefficients C,. (The proof of 
completeness, for a particular set of functions, is an extremely difficult 
business, and I’m afraid physicists tend to assume it’s true and leave the 
checking to others.) A set of functions is orthogonal if the integral of the 
product of any two distinct members of the set is zero: 


f Inf (vy) dy = 0 forn #n. (3.39) 
0 


!2 Are you perhaps disappointed that separation of variables delivers a “solution” (Eq. 3.36) in 
the form of an infinite series? Why not a tidy “closed form,” like Eq. 3.37? (It is rare, actually, 
that the series can be summed explicitly.) In truth, there is less to this distinction than meets 
the eye. Mathematicians like to tell us that sin x (for instance) is “really” shorthand for the 
series 


x = (1/3!)x3 + (5!) . 


A (convergent) series is a perfectly good way to define a function, and the only question is 
whether or not that series has a familiar name. The ones that occur frequently have been 
given names (trigonometric functions, Bessel functions, hypergeometric functions, and so 
on), but that’s nothing deep — just a matter of convenience. 
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The sine functions are orthogonal (Eq. 3.33); this is the property on which 
Fourier’s trick is based, allowing us to kill off all terms but one in the infinite 
series and thereby solve for the coefficients C,. (Proof of orthogonality 
is generally quite simple, either by direct integration or by analysis of the 
differential equation from which the functions came.) 


Example 3.4. Two infinitely long grounded metal plates, again at y = 0 
and y = a, are connected at x = +b by metal strips maintained at a 
constant potential Vo, as shown in Fig. 3.20 (a thin layer of insulation 
at each corner prevents them from shorting out). Find the potential 
inside the resulting rectangular pipe. 


Solution. Once again, the configuration is independent of z. Our 
problem is to solve Laplace’s equation 


V 8V 
— +— =0, 
ôx dy? 
subject to the boundary conditions 
(i) V=0 when y = 0, 
qi) V=0 when y = a, 
(ili) V = Vo when x = b, 
(iv) V= Vo when x = —b. 


(3.40) 


The argument runs as before, up to Eq. 3.27: 
V(x, y) = (4e™ + Be™)(Csinky + D cos ky). 


This time, however, we cannot set A = 0; the region in question does 
not extend to x = œ, so e* is perfectly acceptable. On the other hand, 
the situation is symmetric with respect to x, so V(—x, y) = V(x, y), and 
it follows that A = B. Using 


e + e™ = 2 cosh kx, 
and absorbing 24 into C and D, we have 


V(x, y) = cosh kx(C sin ky + D cos ky). 
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Boundary conditions (1) and (ii) require, as before, that D = 0 and 
k =nmla, so 


V(x, y) = Ccosh(n7x/a) sin(nty/a). (3.41) 


Because V(x, y) is even in x, it will automatically meet condition 
(iv) if it fits (iii). It remains, therefore, to construct the general linear 
combination, 


V(x,y) = o Cn cosh(nrx/a) sin(nty/a), 


n=1 


and pick the coefficients C, in such a way as to satisfy condition (iii): 


V(b, y) = ` Cn cosh(n7tb/a) sin(nttyla) = Vo. 


n=1 


This is the same problem in Fourier analysis that we faced before; I 
quote the result from Eq. 3.35: 


0 if n is even, 


Cn cosh(nib/a) = a 
4Vo/nn if nis odd. 


Conclusion: The potential in this case is given by 
4Vo 


lcosh(umx/a) , 
V = — ——— 2 .42 
0y) es oh cosh(n7tb/a) Sra) 342) 


This function is shown in Fig. 3.21. 


WY 
ee Soe, 
0 LEE 


Example 3.5. An infinitely long rectangular metal pipe (sides a and b) 
is grounded, but one end, at x = 0, is maintained at a specified potential 
Vo(y, z), as indicated in Fig. 3.22. Find the potential inside the pipe. 


Solution. This is a fully three-dimensional problem, 


OV EV EV 
T Of 2 (3.43) 
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ye veo 

P y 

Vo, z) — [i] X 
> 
b x 

3 V=0 


subject to the boundary conditions 


O V=0 when y = 0, 
(li) V =0 when y = a, 
(iii) V =0 when z = 0, (3.44) 
(iv) V=0 when z = b, 
(v) V0 as x > œ, 


(vi) V = VoQ, z) when x = 0. 
As always, we look for solutions that are products: 
V(x, y, z) = X(X)YQ)Z(z). (3.45) 
Putting this into Eq. 3.43, and dividing by V, we find 
1X 1Y 1¢2 


+ + = 
X dx? Y d? Z d2? 


It follows that 

1PX 1Y _ 1 Z 
Xde ©” Yd ?” ZdZ 
Our previous experience (Ex. 3.3) suggests that C; must be positive, C2 
and C; negative. Setting C) = —k? and C; = —P, we have C; =k? +P, 


= C3, with Ci + C + CG =0. 


and hence 
ËX 2.2 PY 2 PZ 2 
ae (k +1°)X, JP = -k Y, ae -I Z. (3.46) 


Once again, separation of variables has turned a partial differential 
equation into ordinary differential equations. The solutions are 


X(x) = Ae EP 4 pe Vo, 


Y(y) = Csin ky + Dcosky, 

Z(z) = Esin lz + F cos lz. 
Boundary condition (v) implies A = 0, (i) gives D = 0, and (iii) yields 
F = 0, whereas (ii) and (iv) require that k = nr/a and l = mr/b, where 


nand mare positive integers. Combining the remaining constants, we 
are left with 
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V(x, y, 2) = CeT N nla? mD sin (n7ty/a) sin(mmz/b). (3.47) 


This solution meets all the boundary conditions except (vi). It contains 
two unspecified integers (n and m), and the most general linear combi- 
nation is a double sum: 


V(x, y,Z) = ` ` Cine” V a)? + (mlb)? x sin(n7ty/a) sin(mrzíb). 
n=1 m=1 


(3.48) 
We hope to fit the remaining boundary condition, 


V(0,y,z) = > 2 Cnm Sin(nttyla) sin(mrtz/b) = Voy, z), (3.49) 


n=1 m=1 
by appropriate choice of the coefficients Cnm. To determine these 
constants, we multiply by sin(n’zty/a) sin(m’7z/b), where n’ and m’ are 
arbitrary positive integers, and integrate: 


De 


n=l m=1 


a b 
sin(nTyla) sin(n’ tyla) wy f sin(m7tz/b) sin(m' nz/b) dz 
0 
a b 
a a Vo, z) sin(n’ my/a) sin(m' 12/b) dy dz. 
0 Jo 


Quoting Eq. 3.33, the left-hand side is (ab/4) Cy ny, so 


4 a b 
Cum = PA f f Vo(y, z) sin(ntty/a) sin(mrz/b) dy dz. (3.50) 
0 Jo 


Equation 3.48, with the coefficients given by Eq. 3.50, is the solution 
to our problem. 

For instance, if the end of the tube is a conductor at constant 
potential Vo, 


4Vo (°. vs, 
Cim = — sin(n7yla) dy sin(mmz/b) dz 
ab 0 0 
E 0 i n or mis even, (3.51) 
16Vo/n2nm_ if nand mare odd. 


In this case, 


vanj 22 Lenas 


2 
p m 
n,m=1,3,5,... n 


x sin(u7ty/a) sin(mnzib). (3.52) 


Notice that the successive terms decrease rapidly; a reasonable approx- 
imation would be obtained by keeping only the first few. 


Problem3.15. Find the potential in the infinite slot of Ex. 3.3 ifthe boundary at x = O consists 
of two metal strips: one, from y = Oto y = a/2, is held at a constant potential Vo, and the 
other, from y = a/2 to y = a, is at potential — Vo. 
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Problem 3.16. For the infinite slot (Ex. 3.3), determine the charge density o (y) on the strip 
at x = 0, assuming itis a conductor at constant potential Vo. 


Problem 3.17. A rectangular pipe, running parallel to the z-axis (from —co to +00), has three 
grounded metal sides, at y = 0, y = a,and x = 0. The fourth side, at x = b, is maintained 
at a specified potential Vo (y). 

(a) Develop a general formula for the potential inside the pipe. 

(b) Find the potential explicitly, forthe case Vo (y) = Vo (a constant). 


Problem 3.18. A cubical box (side length a) consists of five metal plates, which are welded 
together and grounded (Fig. 3.23). The top is made of a separate sheet of metal, insulated 
from the others, and held at a constant potential Vo. Find the potential inside the box. [What 
should the potential at the center (a/2, a/2, a/2) be? Check numerically that your formula is 


consistent with this value.]® 


3.3.2 Spherical Coordinates 


In the examples considered so far, Cartesian coordinates were clearly 
appropriate, since the boundaries were planes. For round objects, spherical 
coordinates are more natural. In the spherical system, Laplace’s equation 
reads 
SLs ec a eee oe 
r20r\ Or r? sin 0 00 00g r2 sin? 0 ðh? 
I shall assume the problem has azimuthal symmetry, so that V is independent 
of p; in that case, Eq. 3.53 reduces to 
ð 5,0V 1 ð 
ar” ay) * sn a0 


=0. (3.53) 


(sin (2 (3.54) 


13 This cute test was suggested by J. Castro. 

14 The general case, for -dependent potentials, is treated in all the graduate texts. See, for 
instance, J. D. Jackson, Classical Electrodynamics, 3rd ed., John Wiley, New York (1999), 
Chap. 3. 
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As before, we look for solutions that are products: 
V(r, 0) = R(r)O(8). (3.55) 
Putting this into Eq. 3.54, and dividing by V, 
dO 


1 d, dR 1 d. 
aa r) + maoa nae) =0. (3.56) 
Since the first term depends only on r, and the second only on 0, it follows 
that each must be a constant: 
1d, dR d 
(PG)=+D. aa 

Here /(/ + 1) is just a fancy way of writing the separation constant — you'll 
see in a minute why this is convenient. 

As always, separation of variables has converted a partial differential 
equation (3.54) into ordinary differential equations (3.57). The radial equa- 
tion, 


(sin 02) ==i714+1). 6.57 


d,.dR 
rag T =1(1+1)R, (3.58) 
has the general solution 
B 
R(r) = Ar! + — (3.59) 


pith? 


as you can easily check; A and B are the two arbitrary constants to be 
expected in the solution of a second-order differential equation. But the 
angular equation, 


d,., dO f 
7 (sin O55) = -/(1+ 1)sin0®, (3.60) 
is not so simple. The solutions are Legendre polynomials in the variable 


cos 0: 


©(0) = P;(cos 0). (3.61) 

The function P;(x) is most conveniently defined by the Rodrigues formula: 
o 1 ,dyi 3 I 

P(x%) = alae Ge =1y, (3.62) 


The first few Legendre polynomials are listed in Table 3.1. Notice that P;(x) 
is (as the name suggests) an /th-order polynomial in x; it contains only even 


Table 3.1. Legendre polynomials. 


Po(x) = 1 

Pi\(x) =x 

P(x) = (3x? — 1)/2 

P3(x) = (5x = 3x)/2 

P4(x) = (35x4 — 30x? + 3)/8 
Ps(x) = (63x° — 70x? + 15x)/8 
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powers if / is even, and odd powers if / is odd. The factor in front, (1/2'/!), 
was chosen (historically) in order that 


P,(1) = 1. (3.63) 


The Rodrigues formula obviously works only for nonnegative integer 
values of /. Moreover, it provides us with only one solution. But Eq. 3.60 
is second order, and it should possess two independent solutions, for every 
value of /. It turns out that these “other solutions” blow up at 0 = 0 and/or 
0 = n, and are therefore unacceptable on physical grounds.!> For instance, 
the second solution for / = 0 is 


@(0) = In(tan J (3.64) 


You might want to check for yourself that this satisfies Eq. 3.60. 

In the case of azimuthal symmetry, then, the most general separable 
solution to Laplace’s equation, consistent with minimal physical require- 
ments, is 


B 
V(r, 0) = (Ar + Tat) Pi(cos 0). 


(There was no need to include an overall constant in Eq. 3.61 because it can 
be absorbed into A and B at this stage.) As before, separation of variables 
yields an infinite set of solutions, one for each /. The general solution is a 
linear combination of separable solutions: 


o0 


Ver, 0) = X (Ar + 41) pi(cos 8). (3.65) 


1+1 
I0 k 


The following examples illustrate the power of this important result. 


Example 3.6. The potential Vo(@) is specified on the surface of a 
hollow sphere, of radius R. Find the potential inside the sphere. 


Solution. In this case, B; = 0 for all /— otherwise the potential would 
blow up at the origin. Thus, 


Vr, @) = Ss Air! P;(cos 0). (3.66) 
/=0 


Atr = R this must match the specified function Vo(@): 


V(R, 0) = >» AiR! P;(cos 0) = Vo(0). (3.67) 
/=0 


Can this equation be satisfied, for an appropriate choice of coefficients 
AI? Yes: the Legendre polynomials (like the sines) constitute a 


15 In rare cases where the z-axis is excluded, these “other solutions” do have to be considered. 
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complete set of functions, on the interval -1 < x < 1(0 < 0 < 7). 
How do we determine the constants? Again, by Fourier’s trick, the 
Legendre polynomials (like the sines) are orthogonal functions:!® 


1 T 
ig Pi(x)Ppr(x) dx = { P;(cos 0)Pp (cos 0) sin 0 d@ 
-1 0 


0 frl 
=f Ar E (3.68) 


2/(21+1) iff =l. 
Thus, multiplying Eq. 3.67 by Pp (cos 0) sin 0 and integrating, we have 


i 2 m 
ArR" —— = Í Vo(0)Pp (cos 0) sin 6 dé, 
ar +1 0 
or 
241 (7 
As f Vo(0)P;(cos 0) sin 0 dé. (3.69) 
oR Jo 


Equation 3.66 is the solution to our problem, with the coefficients 
given by Eq. 3.69. 

It can be difficult to evaluate integrals of the form 3.69 analytically, 
and in practice it is often easier to solve Eq. 3.67 “by eyeball.”!” For 
instance, suppose we are told that the potential on the sphere is 


Vo(0) = ksin? (0/2), (3.70) 
where k is a constant. Using the half-angle formula, we rewrite this as 
k k 
Vo(@) = 5C — cos 0) = z |Po(cos 0) — Pı (cos 0)]. 


Putting this into Eq. 3.67, we read off immediately that Ag = k/2, A, = 
—kI (2R), and all other Ars vanish. Therefore, 


ky 9 r! k r 
V(r,0) = alt Po(cos 0) — Ri ilcos 0) = a -R cos 0). (3.71) 


Example 3.7. The potential Vo(@) is again specified on the surface of 
a sphere of radius R, but this time we are asked to find the potential 
outside, assuming there is no charge there. 


Solution. In this case it’s the A;s that must be zero (else V would not 
go to zero at oo), so 


aB 
V(r, 0) = ) | 7 Pi(cos 0). (3.72) 
/=0 


16 M. Boas, footnote 9, Sect. 12.7. 

17 This is certainly true whenever Vo(@) can be expressed as a polynomial in cos @. The 
degree of the polynomial tells us the highest / we require, and the leading coefficient 
determines the corresponding A). Subtracting off A;R! P;(cos 0) and repeating the process, 
we systematically work our way down to Ag. Notice that if Vo is an even function of cos 0, 
then only even terms will occur in the sum (and likewise for odd functions). 
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At the surface of the sphere, we require that 


foe} 


B 
V(R, 0) = $ rT Pi(cos 0) = Vo(8). 
/=0 


Multiplying by Py (cos 0) sin 0 and integrating — exploiting, again, the 
orthogonality relation 3.68 — we have 


Bp 2 = £ 7 
or 
Be at Ro | Vo(0)Pı(cos 0) sin 0 dé. (3.73) 
0 


Equation 3.72, with the coefficients given by Eq. 3.73, is the solution 
to our problem. 


Example 3.8. An uncharged metal sphere of radius R is placed in an 
otherwise uniform electric field E = Eoz. The field will push positive 
charge to the “northern” surface of the sphere, and — symmetrically 
— negative charge to the “southern” surface (Fig. 3.24). This induced 
charge, in turn, distorts the field in the neighborhood of the sphere. 
Find the potential in the region outside the sphere. 


Solution. The sphere is an equipotential — we might as well set 
V(R,@) = 0. Then by symmetry the entire xy plane is at potential 
zero. This time, however, V does not go to zero at large z. In fact, far 
from the sphere the field is Eyz, and hence 


Vo —Eoz +C. 


Since V =0 in the equatorial plane, the constant C must be zero. 
Accordingly, the boundary conditions for this problem are 


144 


3 Potentials 


(3.74) 


(i) V=0 when r = R, 
(ii) V — -Eor cos 0 forr > R. 


We must fit these boundary conditions with a function of the form 
3.65. 
The first condition yields 


B 
I l 
AR + Ral = 0, 
or 

Re-AR™, (3.75) 
So 

co I RH! 

V(r,0) = Xlr = ar) Pi(cos 0). 
i0 


For r > R, the second term in parentheses is negligible, and therefore 
condition (ii) requires that 


`, Aır' P}(cos 0) = -Eor cos 0. 
1=0 


Evidently, only one term is present: / = 1. In fact, since Pı (cos 0) = 
cos 0, we can read off immediately 
A, =—Ep, all other A;s zero; Bı = EọR?, all other Bs zero. 


Conclusion: 


3 
V(r, 0) = -Eo(r - =) cos 0. (3.76) 
= 


The first term (— Eor cos 0) is due to the external field; the contribu- 
tion attributable to the induced charge is 
RÈ 
Eo— cos 0. 
r 


If you want to know the induced charge density, it can be calculated 
in the usual way: 


0) = dA -anl 0| = 36E, 0. (3.77 
o(@) = -e0 „RTE o( + m) cos |r = 3€0Eo cos 0. (3.77) 
As expected, it is positive in the “northern” hemisphere (0 < 0 < 7/2) 
and negative in the “southern” (7/2 < 0 < 72). 


Example 3.9. A specified charge density o9(@) is glued over the surface 
of an empty spherical shell of radius R. Find the resulting potential 
inside and outside the sphere. 


Solution. You could, of course, do this by direct integration: 


1 
V= f ai da, 
4TEo 4 
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but separation of variables is often easier. For the interior region, we 
have 


V(r, 0) = yy Ajr'P}(cos@) (r< R) (3.78) 
l=0 


(no B; terms — they blow up at the center); in the exterior region, 
AB 
V(r, 0) = X = Pi(cos 8) (>R) (3.79) 
=a 
(no A; terms — they don’t go to zero at infinity). These two functions 


must be joined together by the appropriate boundary conditions at the 
surface itself. First, the potential is continuous at r = R (Eq. 2.34): 


co foe) B 
> AIR! P;(cos 0) = > a Pi(cos 0). (3.80) 
/=0 /=0 


It follows that the coefficients of like Legendre polynomials are equal: 
Bı = AıR”™!. (3.81) 


(To prove that formally, multiply both sides of Eq.3.80 by 
Pp(cos 0)sin 0 and integrate from 0 to 7, using the orthogonality 
relation 3.68.) Second, the radial derivative of V suffers a discontinuity 
at the surface (Eq. 2.36): 


(£ Vout ð Vin 


or or 


Vee = -E00 (3.82) 


Thus 


Z B = 1 
= 5d + 1) 5 P; (cos 0) - ` IAR}! P;(cos 0) = -—00(0), 
1=0 R I0 E0 


or, using Eq. 3.81, 


= 1 
Sal + 1)A)R'! P;(cos 0) = =00(8). (3.83) 
I0 0 
From here, the coefficients can be determined using Fourier’s trick: 
1 TT 
A,= sat l 09(9)P;(cos @) sin 0 dé. (3.84) 


Equations 3.78 and 3.79 constitute the solution to our problem, with 
the coefficients given by Eqs. 3.81 and 3.84. 
For instance, if 


00(90) = kcos 0 = kP; (cos 0), (3.85) 
for some constant k, then all the A,’s are zero except for / = 1, and 
k í k 
A= Ef [P1 (cos 0)]? sin 0 d@ = —. 
2€0 0 3€0 
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The potential inside the sphere is therefore 


k 
Vir, 0) = a. oe (r < R), 


whereas outside the sphere, 


kR? 1 
V(r,0)= Ben 72 cos (r> R). (3.87) 
0 


In particular, if oo(@) is the induced charge on a metal sphere in 
an external field Eoz, so that k = 3€9£o (Eq. 3.77), then the potential 
inside is Eor cos @ = Ez, and the field is —Ez, which is exactly right 
to cancel off the external field, as of course it should be. Outside the 
sphere the potential due to this surface charge is 


RÈ 
Eo— cos 0, 
Z 


consistent with our conclusion in Ex. 3.8. 


Problem3.19. Derive P3 (x) from the Rodrigues formula, and check that P3 (cos 0) sat- 
isfies the angular equation (3.60) for / = 3. Check that P3 and P, are orthogonal by explicit 
integration. 


Problem3.20. 

(a) Suppose the potential is a constant Vo over the surface of the sphere. Use the results of 
Exs. 3.6 and 3.7 to find the potential inside and outside the sphere. (Of course, you know the 
answers in advance — this is just a consistency check on the method.) 


(b) Find the potential inside and outside a spherical shell that carries a uniform surface charge 
0, using the results of Ex. 3.9. 


Problem 3.21. The potential at the surface of a sphere (radius R) is given by 
Vo = kcos30, 


where k is a constant. Find the potential inside and outside the sphere, as well as the surface 
charge density o (0) on the sphere. (Assume there's no charge inside or outside the sphere.) 


Problem 3.22. Suppose the potential V)(@) at the surface ofa sphere is specified, and there 
isno charge inside or outside the sphere. Show that the charge density on the sphere is given by 


Deo 2 
(0) = 55 2o 1)2C;P;(cos 0), (3.88) 
where 
TE 
Ci =| Vo(@)P)(cos 0) sin 0 dé. (3.89) 
0 


Problem 3.23. Find the potential outside a charged metal sphere (charge Q, radius R) placed 
in an otherwise uniform electric field Eọ. Explain clearly where you are setting the zero of 
potential. 


3.4 Multipole Expansion 


Problem 3.24. In Prob. 2.26, you found the potential on the axis of a uniformly charged disk: 
V(r,0) = (vr? + R- r). 
2€9 


(a) Use this, together with the fact that P;(1) = 1, to evaluate the first three terms in the 
expansion (Eq. 3.72) for the potential of the disk at points off the axis, assuming r > R. 

(b) Find the potential for > < R by the same method, using Eq. 3.66. [Note: You must break 
the interior region up into two hemispheres, above and below the disk. Do not assume the 
coefficients A; are the same in both hemispheres. ] 


Problem 3.25. Aspherical shell of radius R carries a uniform surface charge co on the “north- 
ern” hemisphere and a uniform surface charge — oo on the “southern” hemisphere. Find the 
potential inside and outside the sphere, calculating the coefficients explicitly up to A6 and B6. 


Problem 3.26. Solve Laplace’s equation by separation of variables in cylindrical coordinates, 
assuming there is no dependence on z (cylindrical symmetry). [Make sure you find all solutions 
to the radial equation; in particular, your result must accommodate the case of an infinite line 
charge, for which (of course) we already know the answer.] 


Problem 3.27. Find the potential outside an infinitely long metal pipe, of radius R, placed at 
right angles to an otherwise uniform electric field Kg. Find the surface charge induced on the 
pipe. [Use your result from Prob. 3.26.] 


Problem 3.28. Charge density 
o() = asin 5o 


(where a is a constant) is glued over the surface of an infinite cylinder of radius R (Fig. 3.25). 
Find the potential inside and outside the cylinder. [Use your result from Prob. 3.26.] 


3.4 Multipole Expansion 


3.4.1 Approximate Potentials at Large Distances 


If you are very far away from a localized charge distribution, it “looks” like 
a point charge, and the potential is, to good approximation, (1/4reo)Q/r, 
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where Q is the total charge. We have often used this as a check on formulas 
for V. But what if Q is zero? You might reply that the potential is then 
approximately zero, and of course, you're right, in a sense (indeed, the 
potential at large r is pretty small even if Q is not zero). But we’re looking 
for something a bit more informative than that. 


Example 3.10. A (physical) electric dipole consists of two equal and 
opposite charges (+q) separated by a distance d. Find the approximate 
potential at points far from the dipole. 


Solution. Let 2_ be the distance from —q and 7, the distance from +q 
(Fig. 3.26). Then 


Er 4), 


and (from the law of cosines) 
22 = r? + (d/2)? F rd cos 0 = r(1 F @ cos 0 + 7) 
+ r 4727" 


We're interested in the distant zone, where r > d, so the third term is 
negligible, and the binomial expansion yields 


1 ly _d -12 1 d 
z © 7 (iF 7 088) ~ —(1 4 > cos 6). 
Thus 
1 1 
== x — cos 8, 
A, a rP 
and hence 
1 qd 
jee (3.90) 
4TE9 r 


The potential of a dipole goes like 1/7? at large r; as we might 
have anticipated, it falls off more rapidly than the potential of a point 
charge. If we put together a pair of equal and opposite dipoles to make 
a quadrupole, the potential goes like 1/r?; for back-to-back quadrupoles 
(an octopole), it goes like 1/r4; and so on. Figure 3.27 summarizes 
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-= + 
Monopole Dipole Quadrupole Octopole 
(V~ I/r) (V~ 1 (V ~ Ir) (V~ 1/r4) 


this hierarchy; for completeness I have included the electric monopole 
(point charge), whose potential, of course, goes like 1/r. 


Example 3.10 involves a specific charge configuration. I propose now to 
develop a systematic expansion for the potential of any localized charge 
distribution, in powers of 1/r. Figure 3.28 defines the relevant variables; the 
potential at r is given by 


1 1 
=—— | -pr £ .91 
Oe e f Puy dt (3.91) 
Using the law of cosines, 
r! 


=r +(r')? —2rr’ cosa = Pil + (=) - (2) cos al, 


where a is the angle between r and r’. Thus 


rari se, (3.92) 
with 
e = (£)(5-2c0sa). 


For points well outside the charge distribution, e is much less than 1, and 
this invites a binomial expansion: 


1 1 1 
Fai tey=(1- 564 oe Spe), (3.93) 
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or, in terms of r, r’, and a, 


~ = lr- 5(5)(5-2c0sa) + (ZP (Z - 2cosa)? 


yo pl J - 
T Z (£)'(E-2cosa) +] 
= [1 + (Z) cosa) + Hre 
ca], 


In the last step, I have collected together like powers of (r’/r); surprisingly, 
their coefficients (the terms in parentheses) are Legendre polynomials! The 
remarkable result!® is that 


= 15°()"P, (cosa), 3.94) 
n=0 


Substituting this back into Eq. 3.91, and noting that r is a constant, as far 
as the integration is concerned, I conclude that 


foe} 


1 
Yi a | ("Palcos.aypee'y ar’ (3.95) 


y = 
1) ÅT E0 0 


or, more explicitly, 


1 1 , £ 1 , , , 
v= [> f ooa +a fr cos a p(r’) dt 


TTEQ 
1 12 3 2 1 , , 
+5 fo (5 cos a- 5)p(r’)dr tered, (3.96) 


This is the desired result — the multipole expansion of V in powers of 
l/r. The first term (n = 0) is the monopole contribution (it goes like 1/r); 
the second (n = 1) is the dipole (it goes like 1/72); the third is quadrupole; 
the fourth octopole; and so on. Remember that a is the angle between r 
and r’, so the integrals depend on the direction to the field point. If you are 
interested in the potential along the z’-axis (or — putting it the other way 
around — if you orient your r’ coordinates so the z’-axis lies along r), then 
a is the usual polar angle 0’. 

As it stands, Eq. 3.95 is exact, but it is useful primarily as an approx- 
imation scheme: the lowest nonzero term in the expansion provides the 
approximate potential at large r, and the successive terms tell us how to 
improve the approximation if greater precision is required. 


Problem 3.29. A sphere of radius R, centered at the origin, carries charge density 


p(r, 0) = kR — 2r) sin 0, 


18 This suggests a second way of defining the Legendre polynomials (the first being the 
Rodrigues formula); 1/4 is called the generating function for Legendre polynomials. 
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where k is a constant, and r, O are the usual spherical coordinates. Find the approximate 
potential for points on the z-axis, far from the sphere. 


Problem 3.30. A circular ring in the xy plane (radius R, centered at the origin) carries a 
uniform line charge A. Find the first three terms (7 = O, 1, 2) in the multipole expansion 
for V(r, 0). 


3.4.2 The Monopole and Dipole Terms 


Ordinarily, the multipole expansion is dominated (at large r) by the 
monopole term: 


Q 


Vinon(t) = 4m€o F 


(3.97) 


where Q = f p dt is the total charge of the configuration. This is just what 
we expect for the approximate potential at large distances from the charge. 
For a point charge at the origin, Vmon is the exact potential, not merely a 
first approximation at large r; in this case, all the higher multipoles vanish. 

If the total charge is zero, the dominant term in the potential will be the 
dipole (unless, of course, it a/so vanishes): 


11 sas 
Vaip(r) = inet r’ cosa p(r’) dt’. 


Since a is the angle between r’ and r (Fig. 3.28), 
r cosa =f-r, 
and the dipole potential can be written more succinctly: 
Väip(r) = A - [ vee dt’. 
Ame r? 


This integral (which does not depend on r) is called the dipole moment of 
the distribution: 


p= [ vee dt’, (3.98) 
and the dipole contribution to the potential simplifies to 
res (3.99) 
dip Areo r` ` 


The dipole moment is determined by the geometry (size, shape, and den- 
sity) of the charge distribution. Equation 3.98 translates in the usual way 
(Section 2.1.4) for point, line, and surface charges. Thus, the dipole moment 
of a collection of point charges is 


p=) qr; (3.100) 
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+q -4 


For a physical dipole (equal and opposite charges, +q), 
p = qr, — qr = q(t, - r) = qd, (3.101) 


where d is the vector from the negative charge to the positive one 
(Fig. 3.29). 

Is this consistent with what we got in Ex. 3.10? Yes: if you put Eq. 3.101 
into Eq. 3.99, you recover Eq. 3.90. Notice, however, that this is only the 
approximate potential of the physical dipole — evidently there are higher 
multipole contributions. Of course, as you go farther and farther away, 
Vaip becomes a better and better approximation, since the higher terms 
die off more rapidly with increasing r. By the same token, at a fixed r the 
dipole approximation improves as you shrink the separation d. To construct 
a perfect (ideal, pure, point) dipole whose potential is given exactly by 
Eq. 3.99, you’d have to let d approach zero. Unfortunately, you then lose the 
dipole term too, unless you simultaneously arrange for q to go to infinity. 
A physical dipole becomes a pure dipole, then, in the rather artificial limit 
d —> 0, q —> œ, with the product qd = p held fixed. When someone uses 
the word “dipole,” you can’t always tell whether they mean a physical dipole 
(with finite separation between the charges) or an ideal (point) dipole. If in 
doubt, assume that d is small enough (compared to r) that you can safely 
apply Eq. 3.99. 

Dipole moments are vectors, and they add accordingly: if you have two 
dipoles, pı and p,, the total dipole moment is p; + p). For instance, with 
four charges at the corners of a square, as shown in Fig. 3.30, the net 
dipole moment is zero. You can see this by combining the charges in pairs 
(vertically, | + T= 0, or horizontally, => + <= 0) or by adding up the 
four contributions individually, using Eq. 3.100. This is a quadrupole, as I 
indicated earlier, and its potential is dominated by the quadrupole term in 
the multipole expansion. 
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Problem 3.31. Four particles (one of charge q, one of charge 3q, and two of charge —2q) 
are placed as shown in Fig. 3.31, each a distance a from the origin. Find a simple approximate 
formula for the potential, valid at points far from the origin. (Express your answer in spherical 
coordinates.) 


Fig. 3.31 


Problem 3.32. In Ex. 3.9, we derived the exact potential for a spherical shell of radius R, which 
carries a surface charge o = k cos 0. 


(a) Calculate the dipole moment of this charge distribution. 
(b) Find the approximate potential, at points far from the sphere, and compare the exact answer 
(Eq. 3.87). What can you conclude about the higher multipoles? 


Problem 3.33. For the dipole in Ex. 3.10, expand 1/2. to order (d/r)?, and use this to 
determine the quadrupole and octopole terms in the potential. 


3.4.3 Origin of Coordinates in Multipole Expansions 


I mentioned earlier that a point charge at the origin constitutes a “pure” 
monopole. If it is not at the origin, it’s no longer a pure monopole. For 
instance, the charge in Fig. 3.32 has a dipole moment p = qdy, and a corre- 
sponding dipole term in its potential. The monopole potential (1/47€9)q/r 
is not quite correct for this configuration; rather, the exact potential is 
(1/4r€9)q/z. The multipole expansion is, remember, a series in inverse 
powers of r (the distance from the origin), and when we expand 1/2, we 
get all powers, not just the first. 

So moving the origin (or, what amounts to the same thing, moving the 
charge) can radically alter a multipole expansion. The monopole moment 
Q does not change, since the total charge is obviously independent of the 
coordinate system. (In Fig. 3.32, the monopole term was unaffected when 
we moved g away from the origin — it’s just that it was no longer the whole 
story: a dipole term — and for that matter all higher poles — appeared as 
well.) Ordinarily, the dipole moment does change when you shift the origin, 
but there is an important exception: if the total charge is zero, then the dipole 
moment is independent of the choice of origin. For suppose we displace the 
origin by an amount a (Fig. 3.33). The new dipole moment is then 
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-4 
a a 
d 
ee 
-q4 q q ^q 
(a) (b) 


p= | Pee ar z fe-w dt’ 
= Sroa -a | pee) dt’ = p- Qa. 


In particular, if Q = 0, then p = p. So if someone asks for the dipole moment 
in Fig. 3.34a, you can answer with confidence “gd,” but if you’re asked for 
the dipole moment in Fig. 3.34b, the appropriate response would be “With 
respect to what origin?” 


Problem 3.34. Two point charges, 3q and —q, are separated by a distance a. For each of the 
arrangements in Fig. 3.35, find (i) the monopole moment, (ii) the dipole moment (with respect 
to the origin of coordinates), and (iii) the approximate potential (in spherical coordinates) at 
large r (include both the monopole and dipole contributions). 


<y 


(a) (b) (c) 
Fig. 3.35 
3.4.4 The Electric Field of a Dipole 


So far we have worked only with potentials. Now I would like to calculate 
the electric field of a (perfect) dipole. If we choose coordinates so that p is 


155 


3.4 Multipole Expansion 


at the origin and points in the z direction (Fig. 3.36), then the potential at 
r, 0 is (Eq. 3.99) 
f:-p _ pcos® 


Va»(r, 0) = = ' 3.102 
dip", 0) 4negor? Arter? ( ) 


To get the field, we take the negative gradient of V: 


__ôV _ 2pcosé 

"Or Amer?’ 
_ 1dV _ psin@ 

OF 00 Amer’ 
il OV 

° ~~ rsin 6 p ` 

Thus 
Eqip(r, 0) = —2— (2 cos 0f + sin 8 Ô). (3.103) 

Aneéor 


This formula makes explicit reference to a particular coordinate system 
(spherical) and assumes a particular orientation for p (along z). It can be 
recast in a coordinate-free form, analogous to the potential in Eq. 3.99 — 
see Prob. 3.38. 

Notice that the dipole field falls off as the inverse cube of r; the monopole 
field (Q/4rt€yr”)# goes as the inverse square, of course. Quadrupole fields go 
like 1/74, octopole like 1/r°, and so on. (This reflects the fact that monopole 
potentials fall off like 1/r, dipole like 1/r?, quadrupole like 1/73, and so on — 
the gradient introduces another factor of 1/r.) 

Figure 3.37a shows the field lines of a “pure” dipole (Eq. 3.103). For 
comparison, I have also sketched the field lines for a “physical” dipole, 
in Fig. 3.37b. Notice how similar the two pictures become if you blot 
out the central region; up close, however, they are entirely different. Only 
for points r > d does Eq. 3.103 represent a valid approximation to 
the field of a physical dipole. As I mentioned earlier, this régime can be 
reached either by going to large r or by squeezing the charges very close 
together.!° 


19 Even in the limit, there remains an infinitesimal region at the origin where the field of a 
physical dipole points in the “wrong” direction, as you can see by “walking” down the z- 
axis in Fig. 3.35b. If you want to explore this subtle and important point, see Prob. 3.52. 
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(a) Field of a “pure” dipole (b) Field of a “physical” dipole 


Problem 3.35. A “pure” dipole p is situated at the origin, pointing in the z direction. 


(a) What is the force on a point charge q at (a, 0, 0) (Cartesian coordinates)? 
(b) What is the force on qat (0, 0, a)? 
(c) How much work would it take to move q from (a, 0, 0) to (0, 0, a)? 


Problem 3.36. Three point charges are located as shown in Fig. 3.38, each a distance a from 
the origin. Find the approximate electric field at points far from the origin. Express your answer 
in spherical coordinates, and include the two lowest orders in the multipole expansion. 


zh 


Fig. 3.38 


Problem 3.37. A solid sphere, radius R, is centered at the origin. The “northern” hemisphere 
carries a uniform charge density po, and the “southern” hemisphere a uniform charge density 
—po. Find the approximate field E (r, O) for points far from the sphere (r >> R). 


Problem 3.38. Show that the electric field of a (perfect) dipole (Eq. 3.103) can be written in 
the coordinate-free form 


1 1 JE 
Egip(t) = mar hs pl. (3.104) 


=y 
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More Problems on Chapter 3 
E 


Problem 3.39. In Section 3.1.4 | proved that the electrostatic potential at any point P in a 
charge-free region is equal to its average value over any spherical surface (radius R) centered 
at P. Here's an alternative argument that does not rely on Coulomb's law, only on Laplace's 
equation. We might as well set the origin at P. Let Vave (R) be the average; first show that 


Hie Íl $ 
z Wed 
dR ~ 4nR š 


(note that the R? in da cancels the 1/R? out front, so the only dependence on R is in V 
itself). Now use the divergence theorem, and conclude that if 7 satisfies Laplace's equation,”° 
then Vaye(R) = Vave(0) = V(P) forall R. 


Problem 3.40. Here's an alternative derivation of Eq. 3.10 (the surface charge density induced 
on a grounded conducted plane by a point charge q a distance d above the plane). This 
approach?" (which generalizes to many other problems) does not rely on the method of images. 
The total field is due in part to q, and in part to the induced surface charge. Write down the 
z components of these fields — in terms of q and the as-yet-unknown o (x, y) — just below 
the surface. The sum must be zero, of course, because this is inside a conductor. Use that to 
determine o. 


Problem 3.41. Two infinite parallel grounded conducting planes are held a distance a apart. 
A point charge q is placed in the region between them, a distance x from one plate. Find the 
force” on q. Check that your answer is correct for the special cases a —> co and x = a/2. 


Problem 3.42. Consider two conducting spheres, each of radius R and charge Q; the distance 
between their centers is a > 2 R. The force between them was calculated in the mid-nineteenth 
century, by the method of images. Explain how to do it (using Ex. 3.2 and the strategy of 
Prob. 3.41). Show how to determine the image charges q1, q2, q3, . . . , and their respective 
distances from the centers bı = 0, b2, b3,... , but don'tbother with the calculational details 
(we'll save those for the next problem). Is the force always repulsive, or (as in Prob. 3.11) is there 
be a critical separation below which the two spheres attract? 


Problem 3.43. 


(a) Implement the procedure you developed in the previous problem numerically, keeping (let 
us say) 10 image charges in each sphere. Plot the force, from a = 2R upto a = 10R, and 


20 


I thank Ted Jacobson for suggesting this proof. 


21 See J. L. R. Marrero, Am. J. Phys. 78, 639 (2010). 
22 Obtaining the induced surface charge is not so easy. See B. G. Dick, Am. J. Phys. 41, 1289 


(1973); M. Zahn, Am. J. Phys. 44, 1132 (1976); J. Pleines and S. Mahajan, Am. J. Phys. 45, 
868 (1977); and Prob. 3.55 below. 

In the general case (unequal radii and unequal charges) there is almost always such a critical 
separation distance. For more on this problem see J. A. Soules, Am. J. Phys. 58, 1195 (1990); 
J. Lekner, Proc. R. Soc. A 468, 2829 (2012) and Am. J. Phys. 84, 474 (2016). 
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compare the graph you would get if the charge O were uniformly distributed over each 
spherical surface. 

(b) For the special case a = 2R (spheres just touching) derive the explicit formula for 5,, 
and qn (the latter as a multiple of q1). Find qı (as a multiple of Q), and the exact force of 
repulsion between the spheres (Mathematica can sum the infinite series). The answer was 
obtained by Kelvin in 1853: 


244 o 
~ 4reo 4R2 ` 6(1In2)2 7° 


Problem3.44. Two long straight wires, carrying opposite uniform line charges +A, are 
situated on either side of a long conducting cylinder (Fig. 3.39). The cylinder (which carries no 
net charge) has radius R, and the wires are a distance a from the axis. Find the potential. 


a 
€ j 
a 
e 


Fig. 3.39 


A | (s? + a? + 2sa cos f)[(salR)* + R? — 2sacos | ) | 


A > y wi = 
| nswer: V (s, p) 4reo | (s2 +a? — 2sa cos d) |(sal R)? + R2 + 2sa cos o] 


Problem3.45. Buckminsterfullerine is a molecule of 60 carbon atoms arranged like the 
stitching on a soccer ball. It may be approximated as a conducting spherical shell of radius 
R = 3.5 Å. A nearby electron would be attracted, according to Prob. 3.10, so it is not surprising 
that the ion Cgo exists. (Imagine that the electron — on average — smears itself out uniformly 
over the surface.) But how about a second electron? At large distances it would be repelled by 
the ion, obviously, but at a certain distance r (from the center), the net force is zero, and closer 
than this it would be attracted. So an electron with enough energy to get in that close should 
bind. 


(a) Find r, in A. [You'll have to do it numerically.] 

(b) How much energy (in electron volts) would it take to push an electron in (from infinity) to 
the point r? 
[Incidentally, the Cgo ion has been observed.]”* 


Problem 3.46. You can use the superposition principle to combine solutions obtained by 
separation of variables. For example, in Prob. 3.18 you found the potential inside a cubical box, 
if five faces are grounded and the sixth is at a constant potential Vo; by a six-fold superposition 
of the result, you could obtain the potential inside a cube with the faces maintained at specified 
constant voltages V1, V>,... , Ve. In this way, using Ex. 3.4 and Prob. 3.17, find the potential 


24 Richard Mawhorter suggested this problem. 
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inside a rectangular pipe with two facing sides (x = +b) at potential Vo, a third (y = a) at 
Vi, and the last (at y = 0) grounded. 


Problem 3.47. A conducting sphere of radius a, at potential Vo, is surrounded by a thin 
concentric spherical shell of radius b, over which someone has glued a surface charge 


0(0) = kcos 0, 
where & is a constant and @ is the usual spherical coordinate. 


(a) Find the potential in each region: (i) r > band (ii)a < r < b. 

(b) Find the induced surface charge o;(@) on the conductor. 

(c) What is the total charge of this system? Check that your answer is consistent with the 
behavior of V at large r. 


aVolr + (b? — a*)k cos 0/3r2€9, rb, | 


Answer: V (r, 0) = 
aVolr + (r? — a*)k cos 0/3r?eo, as<r<b. 


Problem 3.48. A charge +Q is distributed uniformly along the z-axis from z = —a to z = 
+a. Show that the electric potential at a point r is given by 
Q | | 1 1 


eee = + z (£) Pacos 0) + =(4)"Patcos 0) +- el 


V(r,0) = 
forr > a. 


Problem 3.49. Along cylindrical shell of radius R carries a uniform surface charge og on the 
upper half and an opposite charge — øo on the lower half (Fig. 3.40). Find the electric potential 
inside and outside the cylinder. 


Sy 


—00 


Fig. 3.40 


Problem 3.50. A thin insulating rod, running from z= — atoz = +a, carries the line 
charges indicated below. In each case, find the leading term in the multipole expansion of the 
potential: (a) A = k cos(1z/2a), (b) A =k sin(7z/a), (0) A =k cos(7z/a), where k 
is a constant. 


Problem 3.51. Show that the average field inside a sphere of radius R, due to all the charge 
within the sphere, is 
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1 p 


Eave = a (3.105) 


where p is the total dipole moment. There are several ways to prove this delightfully simple 
result. Here's one method:” 


(a) Show that the average field due to a single charge q at point r inside the sphere is the same 
as the field at r due to a uniformly charged sphere with p = —q/( 3 TLR? ), namely 


1 1 r 
Arey (472 R3) Í pean : 
3 

where is the vector from r to dT’. 
(b) The latter can be found from Gauss’s law (see Prob. 2.13). Express the answer in terms of the 

dipole moment of q. 
(c) Use the superposition principle to generalize to an arbitrary charge distribution. 
(d) While you're at it, show that the average field over the volume of a sphere, due to all the 

charges outside, is the same as the field they produce at the center. 


Problem 3.52. 


(a) Using Eq. 3.103, calculate the average electric field of a dipole, over a spherical volume of 
radius R, centered at the origin. Do the angular integrals first. [Note: You must express r 
and Ô in terms of $, y, and z (see back cover) before integrating. If you don’t understand 
why, reread the discussion in Section 1.4.1.] Compare your answer with the general theorem 
(Eq. 3.105). The discrepancy here is related to the fact that the field of a dipole blows up at 
r = 0. The angular integral is zero, but the radial integral is infinite, so we really don't 
know what to make of the answer. To resolve this dilemma, let's say that Eq. 3.103 applies 
outside a tiny sphere of radius € — its contribution to Eave is then unambiguously zero, and 
the whole answer has to come from the field inside the e-sphere. 

(b) What must the field inside the e-sphere be, in order for the general theorem (Eq. 3.105) 
to hold? [Hint: Since e is arbitrarily small, we're talking about something that is infi- 
nite at r=0 and whose integral over an infinitesimal volume is finite.) [Answer: 


—(p/3e9)5°(r).] 
Evidently, the true field of a dipole is 
Edip (r) = ay, ôf- ea 5 (r) (3.106) 
W aer Dee Sep i 


You may wonder how we missed the contact term (the delta function)? when we calculated 
the field back in Section 3.4.4. The answer is that the differentiation leading to Eq. 3.103 is valid 


25 Another method exploits the result of Prob. 3.4. See B. Y.-K. Hu, Eur. J. Phys. 30, L29 


(2009). See also Prob. 3.56b. 


6 There are other ways of getting the delta-function term in the field of a dipole — my own 


favorite is Prob. 3.53. Note that unless you are right on top of the dipole, Eq. 3.104 is 
perfectly adequate. 
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except at r = 0, but we should have known (from our experience in Section 1.5.1) that the point 
r = Owould be problematic.” 


Problem 3.53. In Ex. 3.9, we obtained the potential of a spherical shell with surface charge 
o(@) = kcos 0. In Prob. 3.32, you found that the field is pure dipole outside; it’s uniform 
inside (Eq. 3.86). Show that the limit R — O reproduces the delta-function term in Eq. 3.106. 


Problem 3.54. 


(a) Suppose a charge distribution p1 (r) produces a potential Vj (r), and some other charge 
distribution 2 (1) produces a potential 72 (r). [The two situations may have nothing in 
common, for all | care — perhaps number 1 is a uniformly charged sphere and number 2 is 
a parallel-plate capacitor. Please understand that pı and p2 are not present at the same 
time; we are talking about two different problems, one in which only p; is present, and 
another in which only p% is present.] Prove Green’s reciprocity theorem: 


{ pindi = i p2V\ dt. (3.107) 


all space all space 


[Hint: Evaluate f E; -E2 dt two ways, first writing E; = —V Vj and using integration 
by parts to transfer the derivative to E2, then writing E2 = —V V2 and transferring the 
derivative to E4.] 

(b) Suppose now that you have two separated conductors (Fig. 3.41). Ifyou charge up conductor 
aby amount Q (leaving b uncharged), the resulting potential of bis, say, Vap. On the other 
hand, if you put that same charge Q on conductor b (leaving a uncharged), the potential of 
awouldbe p4. Use Green's reciprocity theorem to show that V4 = Vha (an astonishing 
result, since we assumed nothing about the shapes or placement of the conductors). 


Fig. 3.41 


Problem 3.55. Use Green’s reciprocity theorem (Prob. 3.54) to solve the following problems. 
[Hint: For distribution 1, use the actual situation; for distribution 2, remove q and specify the 
charge or potential on each conductor.] 


27 See C. P. Frahm, Am. J. Phys. 51, 826 (1983). For applications, see D. J. Griffiths, Am. J. 
Phys. 50, 698 (1982). There are other (perhaps preferable) ways of expressing the contact 
term in Eq. 3.106; see A. Gsponer, Eur. J. Phys. 28, 267 (2007); J. Franklin, Am. J. Phys. 78, 
1225 (2010); and V. Hnizdo, Eur. J. Phys. 32, 287 (2011). 

28 For interesting commentary, see B. Y.-K. Hu, Am. J. Phys. 69, 1280 (2001). 
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(a) Both plates of a parallel-plate capacitor are grounded, and a point charge q is placed 
between them at a distance x from plate 1. The plate separation is d. Find the induced 
charge on each plate. [Answer: 0, = q(x/d — 1); Q2 = —qxld.] 

(b) Two concentric spherical conducting shells (radii a and b) are grounded, and a point charge 
q is placed between them (at radius r). Find the induced charge on each sphere. 

(c) A point charge q is located a distance a from the center of a grounded conducting spherical 
shell of radius R. Find the net charge Q induced on the sphere. (Treat both cases: (i) a > R 
and (ii) a < R.) 


Problem 3.56. Green's reciprocity theorem (Prob. 3.54) has an interesting twin: 
I piEodt = = { p2E, dt, (3.108) 


all space all space 


where E; is the field that would be produced by p; alone, and Ep is the field of o3 alone. This 
theorem has no name, but it amounts to a statement of Newton’s third law for electrostatics: 
it says that the net force of configuration 1 on configuration 2 (if both are present) is equal and 
opposite to the net force of 2 0n1. 


(a) Prove it, starting with Coulomb's law. (As we shall discover in Chapter 8, the third law does 
not hold for time-dependent configurations, where Coulomb's law is invalid; your task is to 
prove that it does hold in electrostatics.) 

(b) Suppose configuration 1 is an arbitrary charge distribution within a sphere of radius R, 
and configuration 2 is a uniform charge distribution filling that same spherical volume. Use 
Eq. 3.108 to show that the average electric field of configuration 1 over the spherical volume 
is given by Eq. 3.105. 


Problem 3.57. 


(a) Show that the quadrupole term in the multipole expansion can be written 


3 
i il ae: 
Vquaa £) = greg 3 > FifjQij 
ij=l 
(in the notation of Eq. 1.31), where 


1 
Oy = 5 | Brin} Pos] dr’. 


1 ifi=j, 
Oi Nee ane 
0 ifi#,s, 


Here, 


29 Formulas for the average field and average potential over spherical surfaces and volumes 
(Section 3.1.4, Probs. 3.4, 3.39, 3.51) can be obtained very efficiently using Eqs. 3.107 and 
3.108. See V. Kovacs and L. Palfalvi, Eur. J. Phys. 42, 045202 (2021). 
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is the Kronecker delta, and Q;; is the quadrupole moment of the charge distribution. 
Notice the hierarchy: 


1 È fipi 1 È fifjQi 


Vmon = — Z, Vaip = 4ne R” Vquad = Arte A 


4meo r 
The monopole moment (Q) is a scalar, the dipole moment (p) is a vector, the quadrupole 
moment (Q;;) is a second-rank tensor, and so on. 

(b) Find all nine components of Q;; for the configuration in Fig. 3.30 (assume the square has 
side a and lies in the xy plane, centered at the origin). 

(c) Show that the quadrupole moment is independent of origin if the monopole and dipole 
moments both vanish. (This works all the way up the hierarchy — the lowest nonzero 
multipole moment is always independent of origin.) 

(d) How would you define the octopole moment? Express the octopole term in the multipole 
expansion in terms of the octopole moment. 


Problem 3.58. In Ex. 3.8 we determined the electric field outside a spherical conductor (radius 
R) placed in a uniform external field Eo. Solve the problem now using the method of images, 
and check that your answer agrees with Eq. 3.76. [Hint: Use Ex. 3.2, but put another charge, —g, 
diametrically opposite q. Let a — oo, with (1/4729) (2q/a”) = — Eo held constant.] 


! Problem 3.59. Forthe infinite rectangular pipe in Ex. 3.4, suppose the potential on the bottom 
(y = 0) and the two sides (x = +b) is zero, but the potential on the top (y = a) is a nonzero 
constant Vo. Find the potential inside the pipe. [Note: This is a rotated version of Prob. 3.17b, 
but set it up as in Ex. 3.4, using sinusoidal functions in y and hyperbolics in x. It is an unusual 
case in which A = O must be included. Begin by finding the general solution to Eq. 3.26 when 
iO 


Z (-1)” cosh(ntx/a) 


n cosh(n7tb/a) 


[Answer Vo (2+ sin (nT yla) ; 


Alternatively, using sinusoidal functions of x and hyperbolics in y, 


_2Vo (-1)” sinh(a,y) E 
ee Qn Sinh(Qpd) cos(Qyx), where a, = (2n — 1)7/2b. 


! Problem 3.60. 


(a) Along metal pipe of square cross section (side length a) is grounded on three sides, while 
the fourth (which is insulated from the rest) is maintained at constant potential Vo. Find 
the net charge per unit length on the side opposite to Vo. [Hint: Use your answer to 
Prob. 3.17 or Prob. 3.59.] 

(b) Along metal pipe of circular cross section (radius R) is divided (lengthwise) into four equal 
sections, three of them grounded and the fourth maintained at constant potential Vo. Find 


30 For further discussion, see S. Hassani, Am. J. Phys. 59, 470 (1991). 
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the net charge per unit length on the section opposite to Vo. [Answer to both (a) and (b): 
As —(€0 Volr) ln 2P 


Problem 3.61. An ideal electric dipole is situated at the origin, and points in the z direction, 
as in Fig. 3.36. An electric charge is released from rest at a point in the xy plane. Show that 
it swings back and forth in a semicircular arc, as though it were a pendulum supported at the 
origin.” 


Problem 3.62. A stationary electric dipole p = p2 is situated at the origin. A positive point 
charge q (mass m) executes circular motion (radius s) at constant speed in the field of the dipole. 
Characterize the plane of the orbit. Find the speed, angular momentum, and total energy of the 


charge.” [Answer: L = 4/qpm/3V3 ne .] 


Problem 3.63. Find the charge density o(@) on the surface of a sphere (radius R) that 
produces the same electric field, for points exterior to the sphere, as a charge q at the point 
a < Ronthe z-axis. [Answer: zp (R? — a°) (R? + a? — 2Ra cos 0)~*”] 


Problem 3.64. Consider a perfect dipole at the origin, pointing in the y direction. Using the 
method of Prob. 2.65, plot the field line that passes through the point (2, 0). (You might as well 
set p/(4Treg) — 1.) What do you suppose is the line integral $ E - dl around the (closed) 
loop? [Beware: This is a trick question. Explain your answer.] Draw a diagram with 25 field lines. 
[Notice that St reamPoints ->25 chooses “representative” field lines, but their density 
is not proportional to the strength of the field, as you can see from where they cross the x-axis 
— they should get farther and farther apart.] 


Problem 3.65. [The method of relaxation] In two dimensions, 


OV _ V(x+el2,y) — V(x — e/2, y) OV _ V(x, y + €/2) — V(x, y — a2) 
Ox € > oy € 


(in the limit €e — 0, of course), so 


V ð S) _ Vixte,y) -2V(x,y) + V(x- e,y) 


Ox? ~ ax Ax e? 
(and similarly for 07 V/ôy?). Laplace's equation says 
V vV 


= te E Se ae I = EY) te IG PE ©) te WV HE 
me ge ell CPE Oey ays ee ye 


z 4V (x, y)| 
= 0, 


31 These are special cases of the Thompson-Lampard theorem; see J. D. Jackson, Am. J. Phys. 
67, 107 (1999). 

32 This charming result is due to R. S. Jones, Am. J. Phys. 63, 1042 (1995). 

33 G. P. Sastry, V. Srinivas, and A. V. Madhav, Eur. J. Phys. 17, 275 (1996). 
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and hence 
V(x, y) = {WVerte.y)+ V(x-e,y)+V(x,y+e)+V(x,y-e)]. (3.109) 


On a checkerboard grid, with squares of side e, V is the average of its values at the four nearest 
neighbors (that’s for infinitesimal €, but it's approximately right for small finite €). 

Suppose we want to solve Laplace's equation in a rectangular region, with specified values 
on its boundaries. We'll cover the region with an e-grid, and start by assigning a value V (7, j) 
to each point (7, j); on the edges this will be the specified boundary value; at interior points it 
can be anything — I'll set them all to zero. Now we go through and reassign to each interior point 
the average of its four neighbors, per Eq. 3.109. Then we repeat, using those updated values, and 
iterate the process until we find that the results are no longer changing significantly. 

Here's a bare-bones implementation in Mathematica (there are many ways to improve on 
it — increasing its speed, accuracy, and versatility — and the same strategy can be used in three 
dimensions and in other mathematical contexts): 


Hee [is = iL, k s= ina, Ik 
more [ial = 2D, sl <= is 
tere [lI] = 2, J <= 
NewV[[i,j]] = 
+ V[i-1, a 
V = NewV]; 
ListPlot3D[V] 


7 = j+1, 
(viii, Jil] + Viii; J-i 
ae DANDANE 


+See 


It takes as input three integers: n (the number of iterations), x (the number of rows in the 
grid), and c (the number of columns), as well as the list of initial values v. (Mathematica will 
also want to know your initial auxiliary list, NewV, but since that will immediately be changed 
it doesn’t matter what you put in — might as well set NewV = V.)Ifyour grid is fairly small, 
you can literally type in the initial v by hand — for example: 


To see this displayed as a grid, type MatrixForm[V] . Here the boundary values are 
constant (4) on the left and right edges, and 0 on the top and bottom — except in the corners, 
where they have been preempted; these values do not change in the course of the calculation. 


(a) Run that example, and compare the result with Fig. 3.21. 

(b) Modify the initial V so as to mimic Fig. 3.18. 

(c) Ona 20 x 20 grid, set the perimeter to O, but assign the four central points a potential of 
4. That central potential must stay fixed, and you'll need to figure out a way to arrange this. 
One possibility is to insert an “T£ ... then” clause: 


Ie [[ (A=S=10]| [laa )ee (J==10 | |J=); Wemv ta, q lee, -- cl 


(Inwords, “ifi = 10or i = 11 andj = 10orj = 11,thensetNewV[ [i,j] ] 
to 4; otherwise ...”) Use n = 500. 


Electric Fields in Matter 


4.1 Polarization 
OO rrr eee 


4.1.1 Dielectrics 


In this chapter, we shall study electric fields in matter. Matter, of course, 
comes in many varieties — solids, liquids, gases, metals, woods, glasses — and 
these substances do not all respond in the same way to electrostatic fields. 
Nevertheless, most everyday objects belong (at least, in good approxima- 
tion) to one of two large classes: conductors and insulators (or dielectrics). 
We have already talked about conductors; these are substances that contain 
an “unlimited” supply of charges that are free to move about through 
the material. In practice, what this ordinarily means is that many of the 
electrons (one or two per atom, in a typical metal) are not associated 
with any particular nucleus, but roam around in response to applied fields. 
In dielectrics, by contrast, all charges are attached to specific atoms or 
molecules — they’re on a tight leash, and all they can do is move a bit 
within the atom or molecule. Such microscopic displacements are not as 
dramatic as the wholesale rearrangement of charge in a conductor, but 
their cumulative effects account for the characteristic behavior of dielectric 
materials. There are actually two principal mechanisms by which electric 
fields can distort the charge distribution of a dielectric atom or molecule: 
stretching and rotating. In the next two sections TIl discuss these processes. 


4.1.2 Induced Dipoles 


What happens to a neutral atom when it is placed in an electric field E? 
Your first guess might well be: “Absolutely nothing — since the atom is 
not charged, the field has no effect on it.” But that is incorrect. Although 
the atom as a whole is electrically neutral, there is a positively charged 
core (the nucleus) and a negatively charged electron cloud surrounding it. 
These two regions of charge within the atom are influenced by the field: the 
nucleus is pushed in the direction of the field, and the electrons the opposite 
way. In principle, if the field is large enough, it can pull the atom apart 
completely, ionizing it (the substance then becomes a conductor). With less 
extreme fields, however, an equilibrium is soon established, for if the center 
of the electron cloud does not coincide with the nucleus, these positive 


167 


4.1 Polarization 


Table 4.1. Atomic polarizabilities (œ/47t€o, in units of 107° m?). Data from Handbook of 


chemistry and Physics, 91st ed., CRC Press, Boca Raton (2010). 


H He Li Be C Ne Na Ar K Cs 
0.667 0.205 24.3 560 1.67 0.396 241 1.64 434 59.4 


and negative charges attract one another, and that holds the atom together. 
The two opposing forces (E pulling the electrons and nucleus apart, their 
mutual attraction drawing them back together) reach a balance, leaving 
the atom polarized, with plus charge shifted slightly one way, and minus 
the other. The atom now has a tiny dipole moment p, which points in the 
same direction as E. Typically, this induced dipole moment is approximately 
proportional to the field (as long as the latter is not too strong): 


p = aE. (4.1) 


The constant of proportionality «æ is called atomic polarizability. Its value 
depends on the detailed structure of the atom in question. Table 4.1 lists 
some experimentally determined atomic polarizabilities. 


Example 4.1. A primitive model for an atom consists of a point 
nucleus (+q) surrounded by a uniformly charged spherical cloud (—q) 
of radius a (Fig. 4.1). Calculate the atomic polarizability of such an 
atom. 


Solution. In the presence of an external field E, the nucleus will be 
shifted slightly to the right and the electron cloud to the left, as shown 
in Fig. 4.2. (Because the actual displacements involved are extremely 
small, as you'll find in Prob. 4.1, it is reasonable to assume that the 
electron cloud retains its spherical shape.) Say that equilibrium occurs 
when the nucleus is displaced a distance d from the center of the sphere. 
At that point, the external field pushing the nucleus to the right exactly 
balances the internal field pulling it to the left: E = E., where Ee is the 
field produced by the electron cloud. Now the field at a distance d from 
the center of a uniformly charged sphere is 


__ l! qd 
€ Antey @B 
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(Prob. 2.13). At equilibrium, then, 


1 qd 


= ae or p= qd = (4neoa)E. 
0 


The atomic polarizability is therefore 


a = 4nea = 3eov, (4.2) 


where v is the volume of the atom. Although this atomic model is 
extremely crude, the result (Eq. 4.2) is not too bad — it’s accurate to 
within a factor of 4 or so for many simple atoms. 


For molecules the situation is not quite so simple, because frequently 
they polarize more readily in some directions than in others. Carbon dioxide 
(Fig. 4.3), for instance, has a polarizability of 4.5 x 10-4°C?m/N when 
you apply the field along the axis of the molecule, but only 2 x 107% for 
fields perpendicular to this direction. When the field is at some angle to the 
axis, you must resolve it into parallel and perpendicular components, and 
multiply each by the pertinent polarizability: 


p= a, E, + aE). 


In that case the induced dipole moment may not even be in the same 
direction as E. And CO} is relatively simple, as molecules go, since the atoms 
at least arrange themselves in a straight line; for a completely asymmetrical 
molecule, Eq. 4.1 is replaced by the most general linear relation between 
E and p: 


Px = Oy EY + OxyEy + a,,E-, 
Py = Qyx Ex + AyyEy + Ay Ez, (4.3) 
Dz = Az, Ey + Az Ey + az2E.. 


The set of nine constants aj constitute the polarizability tensor for the 
molecule. Their values depend on the orientation of the axes you use, 
though it is always possible to choose “principal” axes such that all the 
off-diagonal terms (a), ax, etc.) vanish, leaving just three nonzero polar- 
izabilities: Ay, Qyy, and azz. 
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Problem 4.1. A hydrogen atom (with the Bohr radius of half an angstrom) is situated between 
two metal plates 1 mm apart, which are connected to opposite terminals of a 500 V battery. 
What fraction of the atomic radius does the separation distance d amount to, roughly? Estimate 
the voltage you would need with this apparatus to ionize the atom. [Use the value of a in 
Table 4.1. Moral: The displacements we're talking about are minute, even on an atomic scale.] 


Problem 4.2. According to quantum mechanics, the electron cloud for a hydrogen atom in its 
ground state has a charge density 


qd rla 
r) = — e 
p(r) a 


2 


where q is the charge of the electron and a is the Bohr radius. Find the atomic polarizability of 
such an atom. [Hint: First calculate the electric field of the electron cloud, Æe (r); then expand 
the exponential, assuming r < a.]' 


Problem 4.3. According to Eq. 4.1, the induced dipole moment of an atom is proportional to 
the external field. This is a “rule of thumb,’ not a fundamental law, and it is easy to concoct 
exceptions — in theory. Suppose, for example, the charge density of the electron cloud were 
proportional to the distance from the center, out to a radius R. To what power of E would p be 
proportional in that case? Find the condition on p (r) such that Eq. 4.1 will hold in the weak- 
field limit. 


Problem 4.4. A point charge q is situated a large distance r from a neutral atom of polariz- 
ability œ. Find the force of attraction between them. 


4.13 Alignment of Polar Molecules 


The neutral atom discussed in Section 4.1.2 had no dipole moment to 
start with; p was induced by the applied field. Some molecules have built- 
in, permanent dipole moments. In the water molecule, for example, the 
electrons tend to cluster around the oxygen atom (Fig. 4.4), and since the 
molecule is bent at 105°, this leaves a negative charge at the vertex and 
a net positive charge on the opposite side. (The dipole moment of water 
is unusually large: 6.1 x 107% Cm; in fact, this is what accounts for its 
effectiveness as a solvent.) What happens when such molecules (called polar 
molecules) are placed in an electric field? 

If the field is uniform, the force on the positive end, F, = qE, exactly 
cancels the force on the negative end, F_ = —gE (Fig. 4.5). However, there 
will be a torque: 


N =(r, xX F,) + (rx F_) 
= [(d/2) x (qE)] + [(-d/2) x (-gE)] = qd x E. 


l For a more sophisticated approach, see W. A. Bowers, Am. J. Phys. 54, 347 (1986). 
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Thus a dipole p = qd in a uniform field E experiences a torque 


N=pxE. (4.4) 


Notice that N is in such a direction as to line p up parallel to E; a polar 
molecule that is free to rotate will swing around until it points in the 
direction of the applied field. 

If the field is nonuniform, so that F, does not exactly balance F_, there 
will be a net force on the dipole, in addition to the torque. Of course, E must 
change rather abruptly for there to be significant variation in the space of 
one molecule, so this is not ordinarily a major consideration in discussing 
the behavior of dielectrics. Nevertheless, the formula for the force on a 
dipole in a nonuniform field is of some interest: 


F=F,+F_=q(E, -E_) = q(AE), 


where AE represents the difference between the field at the plus end and 
the field at the minus end. Assuming the dipole is very short, we may use 
Eq. 1.35 to approximate the small change in Ex: 


AE, = (VE,) +d, 
with corresponding formulas for E, and E-. More compactly, 
AE = (d - V)E, 


and therefore? 


F = (p-V)E. (4.5) 


For a “perfect” dipole of infinitesimal length, Eq. 4.4 gives the torque about 
the center of the dipole even in a nonuniform field; about any other point, 
N=(pxE)+(rxF). 


2 In the present context, Eq. 4.5 could be written more conveniently as F = V(p-E). However, 
it is safer to stick with (p-V)E, because we will be applying the formula to materials in which 
the dipole moment (per unit volume) is itself a function of position and this alternative 
expression would imply (incorrectly) that p too is to be differentiated. 
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Problem 4.5. In Fig. 4.6, p; and p% are (perfect) dipoles a distance r apart. What is the torque 
on p due to p>? What is the torque on p, due to p; ? (Comment: In each case, | want the torque 
on the dipole about its own center. If it bothers you that the answers are not equal and opposite, 
see Prob. 4.29.] 


Fig. 4.6 Fig. 4.7 


Problem 4.6. A (perfect) dipole p is situated a distance z above an infinite grounded conduct- 
ing plane (Fig. 4.7). The dipole makes an angle O with the perpendicular to the plane. Find the 
torque on p. If the dipole is free to rotate, in what orientation will it come to rest? 


Problem 4.7. Show that the energy of a permanent? ideal dipole p, in an electric field E, is 
given by 


U=-p-E. (4.6) 


Problem 4.8. Show that the interaction energy of two ideal dipoles separated by a displace- 
mentr is 


1 1 a 0 
= Tea al Po — 3(py -F)(p2 - Î)]. (4.7) 


[Hint: Use Prob. 4.7 and Eq. 3.104.] 


Problem 4.9. A dipole p is a distance r from a point charge q, and oriented so that p makes 
an angle O with the vector r from q to p. 


(a) What is the force on p? 
(b) What is the force on q? 


4.1.4 Polarization 


In the previous two sections, we considered the effect of an external electric 
field on an individual atom or molecule. We are now in a position to answer 
(qualitatively) the original question: What happens to a piece of dielectric 
material when it is placed in an electric field? If the substance consists of 
neutral atoms (or nonpolar molecules), the field will induce in each a tiny 


3 This means fixed in magnitude. Beware of applying Eq. 4.6 to an induced dipole (Eq. 4.1) — 
it’s not incorrect, but it is not the whole story. See Prob. 4.32. 
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dipole moment, pointing in the same direction as the field.* If the material 
is made up of polar molecules, each permanent dipole will experience a 
torque, tending to line it up along the field direction. (Random thermal 
motions compete with this process, so the alignment is never complete, 
especially at higher temperatures, and disappears almost at once when the 
field is removed.) 

Notice that these two mechanisms produce the same basic result: a lot of 
little dipoles pointing along the direction of the field — the material becomes 
polarized. A convenient measure of this effect is 


P = dipole moment per unit volume, 


which is called the polarization. From now on we shall not worry much 
about how the polarization got there. Actually, the two mechanisms I 
described are not as clear-cut as I tried to pretend. Even in polar molecules 
there will be some polarization by displacement (though generally it is a 
lot easier to rotate a molecule than to stretch it, so the second mechanism 
dominates). It’s even possible in some materials to “freeze in” polarization, 
so that it persists after the field is removed. But let’s forget for a moment 
about the cause of the polarization; we’re going to study the field that a 
chunk of polarized material itself produces. Then, in Section 4.3, we'll put 
it all together: the original field, which was responsible for P, plus the new 
field, which is due to P. 


4.2 The Field of a Polarized Object 


4.2.1 Bound Charges 


Suppose we have a piece of polarized material — that is, an object containing 
a lot of microscopic dipoles lined up. The dipole moment per unit volume 
P is given. Question: What is the field produced by this object (not the field 
that may have caused the polarization, but the field the polarization itself 
causes)? Well, we know what the field of an individual dipole looks like, so 
why not chop the material up into infinitesimal dipoles and integrate to get 
the total? As usual, it’s easier to work with the potential. For a single dipole 
p (Eq. 3.99), 

1 p-% 


VD Arey ae 


(4.8) 


where + is the vector from the dipole to the point at which we are evaluating 
the potential (Fig. 4.8). In the present context, we have a dipole moment 
p = P dv’ in each volume element dt’, so the total potential is 


4 In asymmetric molecules, the induced dipole moment may not be parallel to the field, but if 
the molecules are randomly oriented, the perpendicular contributions will average to zero. 
Within a single crystal, the orientations are certainly not random, and we would have to treat 
this case separately. 
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Vise i EDS ak (4.9) 
y 


ÅTE0 yr 
That does it, in principle. But a little sleight of hand casts this integral 


into a much more illuminating form. Observing that 


ly 2 
VG) = 


~ 
where (unlike Prob. 1.13) the differentiation is with respect to the source 
coordinates (r’), we have 


1 ple > 


Integrating by parts, using product rule number 5 (inside the front cover), 


gives 
1 P 1 
= — V’-(—)dt’- ZV- P)ar'|, 
d zzl, (z . fz jat 


or, invoking the divergence theorem, 


1 1 1 1 
= —P.- da’ — —— —(V’-P)dt’. 4.10 
r 4T E0 $5 a ATED P par ( ) 


The first term looks like the potential of a surface charge, 


o,=P-a (4.11) 


(where n is the normal unit vector), while the second term looks like the 
potential of a volume charge, 


pp =—-V-P. (4.12) 


With these definitions, Eq. 4.10 becomes 


1 Op 1 Pb 
V(r) = da’ dt’. 4.1 
G 4TE9 ¢ 4 oo: 4TE9 Í, 4 T ( 3) 


This means that the potential (and hence also the field) of a polarized 
object is the same as that produced by a volume charge density pp = 
-V -P plus a surface charge density o, =P - ñ. Instead of integrating the 
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contributions of all the infinitesimal dipoles, as in Eq. 4.9, we could first 
find those bound charges, and then calculate the fields they produce, in the 
same way we calculate the field of any other volume and surface charges 
(for example, using Gauss’s law). 


Example 4.2. Find the electric field produced by a uniformly polarized 
sphere of radius R. 


Solution. We might as well choose the z-axis to coincide with the 
direction of polarization (Fig. 4.9). The volume bound charge density 
p» is zero, since P is uniform, but 


op =P-n= Pcos8, 


where 0 is the usual spherical coordinate. What we want, then, is 
the field produced by a charge density Pcos@ plastered over the 
surface of a sphere. But we already computed the potential of such 
a configuration (Eqs. 3.86 and 3.87): 


P 
=—rcos@ forr < R, 
3€0 


P RÈ 
3e, y O88 forr > R. 
or 


Since r cos 0 = z, the field inside the sphere is uniform: 
P 


Rely ess ral P reaR (4.14) 
3€0 3€0 


V(r, 0) = 


This remarkable result will be very useful in what follows. Outside the 
sphere the potential is identical to that of a perfect dipole at the origin, 
1 -f 
=~ PE forrzR, (4.15) 
4neg r2 
whose dipole moment is, not surprisingly, equal to the total dipole 
moment of the sphere: 


4 
p= zR P. (4.16) 


The field of the uniformly polarized sphere is shown in Fig. 4.10. 
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Problem 4.10. A sphere of radius R carries a polarization 
P(r) = kr, 
where k is a constant and r is the vector from the center. 


(a) Calculate the bound charges op and pz. 
(b) Find the field inside and outside the sphere. 


Problem 4.11. A short cylinder, of radius a and length L, carries a “frozen-in” uniform polar- 
ization P, parallel to its axis. Find the bound charge, and sketch the electric field (i) for L > a, 
(ii) for L < a, and (iii) for L ~ a. [This is known as a bar electret; it is the electrical analog 
to a bar magnet. In practice, only very special materials — barium titanate is the most “familiar” 
example — will hold a permanent electric polarization. That’s why you can’t buy electrets at the 
toy store.] 


Problem 4.12. Calculate the potential of a uniformly polarized sphere (Ex. 4.2) directly from 
Eq. 4.9. 


4.2.2 Physical Interpretation of Bound Charges 


In the last section we found that the field of a polarized object is identical 
to the field that would be produced by a certain distribution of “bound 
charges,” op and pp. But this conclusion emerged in the course of abstract 
manipulations of the integral in Eq. 4.9, and left us with no clue as to the 
physical meaning of these bound charges. Indeed, some authors give you 
the impression that bound charges are in some sense “fictitious” — mere 
bookkeeping devices used to facilitate the calculation of fields. Nothing 
could be further from the truth: p, and op represent perfectly genuine 
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accumulations of charge. In this section IIl explain how polarization leads 
to these charge distributions. 

The basic idea is very simple: Suppose we have a long string of dipoles, 
as shown in Fig. 4.11. Along the line, the head of one effectively cancels the 
tail of its neighbor, but at the ends there are two charges left over: plus at 
the right-hand end and minus at the left. It is as if we had peeled off an 
electron at one end and carried it all the way down to the other end, though 
in fact no single electron made the whole trip — a lot of tiny displacements 
add up to one large one. We call the net charge at the ends bound charge to 
remind ourselves that it cannot be removed; in a dielectric every electron is 
attached to a specific atom or molecule. But apart from that, bound charge 
is no different from any other kind. 

To calculate the actual amount of bound charge resulting from a given 
polarization, examine a “tube” of dielectric parallel to P. The dipole 
moment of the tiny chunk shown in Fig. 4.12 is P(Ad), where A is the cross- 
sectional area of the tube and d is the length of the chunk. In terms of the 
charge (q) at the end, this same dipole moment can be written as qd. The 
bound charge that piles up at the right-hand end of the tube is therefore 


q = PA. 
If the ends have been sliced off perpendicularly, the surface charge density is 


o=2=P. 


For an oblique cut (Fig. 4.13), the charge is still the same, but A = Aeng cos 0, 
so 
q 
Aend 


The effect of the polarization, then, is to paint a bound charge op = P . ñ 
over the surface of the material. This is exactly what we found by more 
rigorous means in Section 4.2.1. But now we know where the bound charge 
comes from. 

If the polarization is nonuniform, we get accumulations of bound charge 
within the material, as well as on the surface. A glance at Fig. 4.14 suggests 


= Pecos =P. îû. 


Op = 
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that a diverging P results in a pileup of negative charge. Indeed, the net 
bound charge f padt in a given volume is equal and opposite to the 
amount that has been pushed out through the surface. The latter (by the 
same reasoning we used before) is P - ñ per unit area, so 


f poar=- op P-da=- | 0-Par. 
v S v 


Since this is true for any volume, we have 
Pb =-V. P, 


confirming, again, the more rigorous conclusion of Section 4.2.1. 


Example 4.3. There is another way of analyzing the uniformly polar- 
ized sphere (Ex. 4.2), which nicely illustrates the idea of a bound 
charge. What we have, really, is two spheres of charge: a positive 
sphere and a negative sphere. Without polarization the two are super- 
imposed, and cancel completely. But when the material is uniformly 
polarized, all the plus charges move slightly upward (the z direc- 
tion), and all the minus charges move slightly downward (Fig. 4.15). 
The two spheres no longer coincide perfectly: at the top there’s a 
“cap” of leftover positive charge and at the bottom a cap of neg- 
ative charge. This “leftover” charge is precisely the bound surface 
charge op. 
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In Prob. 2.19, you calculated the field in the region of overlap 

between two oppositely charged spheres; the answer was 
1 qd 
Ate 0 R ° 

where q is the total charge of the positive sphere, d is the vector from 
the negative center to the positive center, and R is the radius of the 
sphere. We can express this in terms of the polarization of the sphere, 
p=qd = ($n R?)P, as 


1 
3E€0 ` 
Meanwhile, for points outside, it is as though all the charge on each 
sphere were concentrated at the respective centers. We have, then, a 
dipole, with potential 


> lp 
~ Arey r` 


(Remember that d is some small fraction of an atomic radius; Fig. 4.15 
is grossly exaggerated.) These answers agree, of course, with the results 
of Ex. 4.2. 


Problem 4.13. Avery long cylinder, of radius a, carries a uniform polarization P perpendicular 
to its axis. Find the electric field inside the cylinder. Show that the field outside the cylinder can 
be expressed in the form 


a m 
E(r) = me -§)s — P]. 


[Careful: | said “uniform,” not “radial”!] 


Problem 4.14. When you polarize a neutral dielectric, the charge moves a bit, but the total 
remains zero. This fact should be reflected in the bound charges o; and pp. Prove from Eqs. 4.11 
and 4.12 that the total bound charge vanishes. 


4.2.3 Microscopic and Macroscopic Fields 


I have been sloppy about the distinction between “pure” dipoles and 
“physical” dipoles.° In developing the theory of bound charges, I assumed 
we were working with the pure kind — indeed, I started with Eq. 4.8, 
the formula for the potential of a perfect dipole. And yet, an actual 
polarized dielectric consists of physical dipoles, albeit extremely tiny ones. 
What is more, I presumed to represent discrete molecular dipoles by a 
continuous density function P. How can I justify this method? Outside 


5 This section can be skipped without loss of continuity. 


179 


4.2 The Field of a Polarized Object 


the dielectric there is no real problem: here we are far away from the 
molecules (z is many times greater than the separation distance between 
plus and minus charges), so the dipole potential dominates overwhelm- 
ingly and the detailed “graininess” of the source is blurred by distance. 
Inside the dielectric, however, we can hardly pretend to be far from all 
the dipoles, and the procedure I used in Section 4.2.1 is open to serious 
challenge. 

In fact, when you stop to think about it, the electric field inside matter 
must be fantastically complicated, on the microscopic level. If you happen 
to be very near an electron, the field is gigantic, whereas a short distance 
away it may be small or point in a totally different direction. Moreover, 
an instant later, as the charges move around, the field will have altered 
entirely. This true microscopic field would be utterly impossible to calculate; 
and it wouldn't be of much interest if you could. Just as, for macroscopic 
purposes, we regard water as a continuous fluid, ignoring its molecular 
structure, so also we can ignore the microscopic bumps and wrinkles in the 
electric field inside matter, and concentrate on the macroscopic field. This 
is defined as the average field over regions large enough to contain many 
thousands of atoms (so that the uninteresting microscopic fluctuations are 
smoothed over), and yet small enough to ensure that we do not wash out 
any significant large-scale variations in the field. (In practice, this means 
we must average over regions much smaller than the dimensions of the 
object itself.) Ordinarily, the macroscopic field is what people mean when 
they speak of “the” field inside matter.® 

But is the macroscopic field what we actually get, when we use the 
methods of Section 4.2.1? The argument is subtle, so hang on. Suppose I 
want to calculate the macroscopic field at some point r within a dielectric 
(Fig. 4.16). I know I must average the true (microscopic) field over an 
appropriate volume, so let me draw a small sphere about r, of radius, say, 
a thousand times the size of a molecule. The macroscopic field at r, then, 
consists of two parts: the average field over the sphere due to all charges 
outside, plus the average due to all charges inside: 


E= Eout T Ein. 


2 


6 In case the notion of macroscopic fields sounds suspicious to you, let me point out that you 


do exactly the same averaging whenever you speak of the density of a material. 
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You proved in Prob. 3.51d that the average field (over a sphere), produced 
by charges outside, is equal to the field they produce at the center, so Eout is 
the field at r due to the dipoles exterior to the sphere. These are far enough 
away that we can safely use Eq. 4.9: 


1 Par’) -2 

Vout = dt’. 4.1 

og | at (4.17) 
outside 


The dipoles inside the sphere are too close to treat in this fashion. But fortu- 
nately all we need is their average field, and that, according to Eq. 3.105, is 


1 p 
4TE9 R ° 


Ein = 


regardless of the details of the charge distribution within the sphere. The 
only relevant quantity is the total dipole moment, p = ($70 R3)P: 


Es = nlp (4.18) 
3€0 


Now, by assumption, the sphere is small enough that P does not vary 
significantly over its volume, so the term left out of the integral in Eq. 4.17 
corresponds to the field at the center of a uniformly polarized sphere, to wit, 
—(1/3€9)P (Eq. 4.14). But this is precisely what Ej, (Eq. 4.18) puts back in! 
The macroscopic field, then, is given by the potential 


rge T a, (4.19) 


2? 


where the integral runs over the entire volume of the dielectric. This 
is exactly what we used in Section 4.2.1; without realizing it, we were 
correctly calculating the averaged, macroscopic field, for points inside the 
dielectric. 

You may have to reread the last couple of paragraphs for the argument 
to sink in. Notice that it all revolves around the curious fact that the average 
field over any sphere (due to the charge inside) is the same as the field 
at the center of a uniformly polarized sphere with the same total dipole 
moment. This means that no matter how crazy the actual microscopic 
charge configuration, we can replace it by a nice smooth distribution of 
perfect dipoles, if all we want is the macroscopic (average) field. Inciden- 
tally, while the argument ostensibly relies on the spherical shape I chose to 
average over, the macroscopic field is certainly independent of the geometry 
of the averaging region, and this is reflected in the final answer, Eq. 4.19. 
Presumably one could reproduce the same argument for a cube or an 
ellipsoid or whatever — the calculation might be more difficult, but the 
conclusion would be the same. 
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4.3 The Electric Displacement 
E SSS SSS 


4.3.1 Gauss’s Law in the Presence of Dielectrics 


In Section 4.2 we found that the effect of polarization is to produce 
accumulations of (bound) charge, pp = —V- P within the dielectric and 
op = P - non the surface. The field due to polarization of the medium is 
just the field of this bound charge. We are now ready to put it all together: 
the field attributable to bound charge plus the field due to everything else 
(which, for want of a better term, we call free charge, pf). The free charge 
might consist of electrons on a conductor or ions embedded in the dielectric 
material or whatever; any charge, in other words, that is not associated 
with P. Within the dielectric, the total charge density can be written 


P = Pot pf, (4.20) 
and Gauss’s law reads 
€o0V-E=p=py)+pr=-V-P+ py, 


where E is now the total field, not just the part generated by polarization. 
It is convenient to combine the two divergence terms: 


V - (€0E + P) = py. 
The expression in parentheses, designated by the letter D, 


D = coE+P, (4.21) 


is known as the electric displacement. In terms of D, Gauss’s law reads 


V-D=py;, (4.22) 


or, in integral form, 
$ D: da = Qro» (4.23) 


where Qy,,, denotes the total free charge enclosed in the volume. This is a 
particularly useful way to express Gauss’s law, in the context of dielectrics, 
because it makes reference only to free charges, and free charge is the stuff we 
control. Bound charge comes along for the ride: when we put the free charge 
in place, a certain polarization automatically ensues, by the mechanisms 
of Section 4.1, and this polarization produces the bound charge. In a 
typical problem, therefore, we know py, but we do not (initially) know p»; 
Eq. 4.23 lets us go right to work with the information at hand. In particular, 
whenever the requisite symmetry is present, we can immediately calculate 
D by the standard Gauss’s law methods. 
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Example 4.4. A long straight wire, carrying uniform line charge A, is 
surrounded by rubber insulation out to a radius a (Fig. 4.17). Find the 
electric displacement. 


Gaussian surface 


Solution. Drawing a cylindrical Gaussian surface, of radius s and 


length L, and applying Eq. 4.23, we find 


D(QnmsL) = AL. 


Therefore, 
A 


D = ——s. (4.24) 
27s 


Notice that this formula holds both within the insulation and outside 
it. In the latter region, P = 0, so 


eee i § fors>a. 
€0 2TLEQS 


Inside the rubber, the electric field cannot be determined, since we do 
not know P. 


It may appear to you that I left out the surface bound charge op in 
deriving Eq. 4.22, and in a sense that is true. We cannot apply Gauss’s law 
precisely at the surface of a dielectric, for here p, blows up,’ taking the 
divergence of E with it. But everywhere e/se the logic is sound, and in fact if 
we picture the edge of the dielectric as having some finite thickness, within 
which the polarization tapers off to zero (probably a more realistic model 
than an abrupt cutoff anyway), then there is no surface bound charge; 
Pp Varies rapidly but smoothly within this “skin,” and Gauss’s law can be 
safely applied everywhere. At any rate, the integral form (Eq. 4.23) is free 
from this “defect.” 


Problem 4.15. A thick spherical shell (inner radius a, outer radius b) is made of dielectric 
material with a “frozen-in” polarization 


P(r) = 


7 The polarization drops abruptly to zero outside the material, so its derivative is a delta 


function (see Prob. 1.46). The surface bound charge is precisely this term — in this sense 
it is actually included in pp, but we ordinarily prefer to handle it separately as op. 
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(a) Sphere (b) Needle (c) Wafer 
Fig. 4.18 Fig. 4.19 


where k is a constant and r is the distance from the center (Fig. 4.18). (There is no free charge in 
the problem.) Find the electric field in all three regions by two different methods: 


(a) Locate all the bound charge, and use Gauss’s law (Eq. 2.13) to calculate the field it produces. 
(b) Use Eq. 4.23 to find D, and then get E from Eq. 4.21. [Notice that the second method is 
much faster, and it avoids any explicit reference to the bound charges. ] 


Problem 4.16. Suppose the field inside a large piece of dielectric is Eg, so that the electric 
displacement is Do = €0Eọ + P. 


(a) Nowa small spherical cavity (Fig. 4.19a) is hollowed out of the material. Find the field at 
the center of the cavity in terms of Ep and P. Also find the displacement at the center of 
the cavity in terms of Do and P. Assume the polarization is “frozen in,’so it doesn't change 
when the cavity is excavated. 

(b) Do the same for a long needle-shaped cavity running parallel to P (Fig. 4.19b). 

(c) Do the same for a thin wafer-shaped cavity perpendicular to P (Fig. 4.19¢). 


Assume the cavities are small enough that P, Ko, and Do are essentially uniform. [Hint: 
Carving out a cavity is the same as superimposing an object of the same shape but opposite 
polarization.] 


4.3.2 A Deceptive Parallel 


Equation 4.22 looks just like Gauss’s law, only the total charge density p is 
replaced by the free charge density pp, and D is substituted for eK. For this 
reason, you may be tempted to conclude that D is “just like” E (apart from 
the factor €o), except that its source is py instead of p: “To solve problems 
involving dielectrics, you just forget all about the bound charge — calculate 
the field as you ordinarily would, only call the answer D instead of E.” This 
reasoning is seductive, but the conclusion is false; in particular, there is no 
“Coulomb’s law” for D: 


1 Be ey 


The parallel between E and D is more subtle than that. 
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For the divergence alone is insufficient to determine a vector field; you 
need to know the curl as well. One tends to forget this in the case of 
electrostatic fields because the curl of E is always zero. But the curl of D 
is not always zero: 


VxD=e0(V x E) + (Vx P) = VxP, (4.25) 


and there is no reason, in general, to suppose that the curl of P vanishes. 
Sometimes it does, as in Ex. 4.4 and Prob. 4.15, but more often it does not. 
The bar electret of Prob. 4.11 is a case in point: here there is no free charge 
anywhere, so if you really believe that the only source of D is pr, you will 
be forced to conclude that D = 0 everywhere, and hence that E = (—1/e9)P 
inside and E = 0 outside the electret, which is obviously wrong. (I leave it for 
you to find the place where V x P + 0 in this problem.) Because V x D + 0, 
moreover, D cannot be expressed as the gradient of a scalar — there is no 
“potential” for D. 

Advice: When you are asked to compute the electric displacement, first 
look for symmetry. If the problem exhibits spherical, cylindrical, or plane 
symmetry, then you can get D directly from Eq. 4.23 by the usual Gauss’s 
law methods. (Evidently, in such cases V x P is automatically zero, but since 
symmetry alone dictates the answer, you're not really obliged to worry about 
the curl.) If the requisite symmetry is absent, you'll have to think of another 
approach, and, in particular, you must not assume that D is determined 
exclusively by the free charge. 


4.3.3 Boundary Conditions 


The electrostatic boundary conditions of Section 2.3.5 can be recast in 
terms of D. Equation 4.23 tells us the discontinuity in the component 
perpendicular to an interface: 


D+ 


above 


— Diow = Of (4.26) 
while Eq. 4.25 gives the discontinuity in parallel components: 


DI! Dlo = Ploae P! (4.27) 


above ~ below above below’ 


In the presence of dielectrics, these are sometimes more useful than the 
corresponding boundary conditions on E (Eqs. 2.31 and 2.32): 


1 
E;bove > EE sesi z a (4.28) 
and 
ll ll = 
Ei cue ~ Eee = 0. (4.29) 


You might try applying them, for example, to Probs. 4.16 and 4.17. 
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Problem 4.17. For the bar electret of Prob. 4.11, make three careful sketches: one of P, one of 
E, and one of D. Assume L is about 2a. [Hint: E lines terminate on charges; D lines terminate 
on free charges.] 


4.3.4 The Crystal Ambiguity 


I began this chapter® with a description of dielectrics as neutral atoms or 
molecules that can be stretched or rotated by an electric field to form a 
collection of microscopic aligned dipoles. But what about an ionic crystal, 
such as ordinary table salt? It’s a matrix of positive sodium ions (Na+) and 
negative chlorine ions (Cl-), as indicated schematically in Fig. 4.20. It’s not 
a conductor (the electrons are attached to particular atoms); it behaves, in 
fact, as a kind of dielectric. But it does raise an intriguing puzzle: What is 
its polarization, P? 

You see, it’s not clear which chlorine ion should be associated with a given 
sodium ion, to make a dipole p — the one above, or the one below, or the one 
to the left, or the one to the right? If you pick the ones below (Fig. 4.21a), 
then all the dipoles will be pointing upward, if you pick the ones above, 
they’re all pointing downward (Fig. 4.21b); pick the ones to the left, and the 
dipoles all point to the right (Fig. 4.21c), pick the ones to the right, and the 
dipoles all point to the left (Fig. 4.21d). So P could point up, down, left, or 
right (or for that matter in, or out), depending on how you decide to form 
the dipoles! In each case there will be some leftover charges at the edges, 


OOOOVOOO0O 
ODOOOOO® 
OOOOOOOO 
OOOOOOO® 
OOOOOOOO 
ODOOOOO® 
DOOOOOOO 
ODOOOOO® 


8 This section can be skipped without loss of continuity. 
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which (since they are not associated with the basic dipoles) we identify as 
“free” charge, oy. It exactly cancels the “bound” surface charge, o}, in each 
case. But where it is located, and what sign it carries, depends (again) on 
our arbitrary choice. So not only is the polarization ambiguous, but so too 
is the distinction between free and bound charge. 

We could even arrange for zero polarization (Fig. 4.22), with no free or 
bound charge. Indeed, in some sense this is the most “natural” choice. But 
in truth all of these different arrangements are equally “correct”; they all 
represent the same physical configuration. What we have discovered is that 
(at least in the case of ionic crystals), the notions of “dipole moment per 
unit volume,” “free charge,” and “bound charge” (and therefore also the 
electric displacement D) are intrinsically ill defined. 

But it doesn’t really matter. What matters is the total charge p. In the 
salt crystal (Fig. 4.20) everybody agrees that the total (macroscopic) charge 
density is zero, and the (macroscopic) electric field vanishes; they argue only 
about how to chop the charge into free and bound parts, and how to com- 
bine the ions to form dipoles. At the end of the day, it is changes in polariza- 
tion (when, for example, you turn on a field and the ions adjust slightly in 
response) that carry genuine significance. This is where the physics resides, 
not in the story we tell ourselves to help us visualize what is happening. 


4.4 Linear Dielectrics 
E N] 


4.4.1 Susceptibility, Permittivity, Dielectric Constant 


In Sections 4.2 and 4.3 we did not commit ourselves as to the cause of 
P; we dealt only with the effects of polarization. From the qualitative 
discussion of Section 4.1, though, we know that the polarization of a 
dielectric ordinarily results from an applied electric field, which lines up the 
atomic or molecular dipoles. For many substances, in fact, the polarization 
is proportional to the field, provided E is not too strong:? 


P = €0XeE. (4.30) 


? Notice that Eq. 4.30 removes any ambiguity in the case of crystal dielectrics (Section 4.3.4). 
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The constant of proportionality, Xe, is called the electric susceptibility of 
the medium (a factor of €ọ has been extracted to make xe dimensionless). 
The value of xe depends on the microscopic structure of the substance in 
question (and also on external conditions such as temperature). I shall call 
materials that obey Eq. 4.30 linear dielectrics. !° 

Note that E in Eq. 4.30 is the total field; it may be due in part to free 
charges and in part to the polarization itself. If, for instance, we put a piece 
of dielectric into an external field Eo, we cannot compute P directly from 
Eq. 4.30; the external field will polarize the material, and this polarization 
will produce its own field, which then contributes to the total field, and this 
in turn modifies the polarization, which .... Breaking out of this infinite 
regress is not always easy. You'll see some examples in a moment. The 
simplest approach is to begin with the displacement, at least in those cases 
where D can be deduced directly from the free charge distribution. 

In linear media we have 


D = €E + P = €E + €0XeE = €o(1 + Xe)E, (4.31) 
so D is also proportional to E: 
D = eE, (4.32) 


where 
E€ = €o(l + Xe). (4.33) 


This new constant e is called the permittivity of the material. (In vacuum, 
where there is no matter to polarize, the susceptibility is zero, and the 
permittivity is €9. That’s why €o is called the permittivity of free space. I 
dislike the term, for it suggests that the vacuum is just a special kind of 
linear dielectric, in which the permittivity happens to have the value 8.85 x 
10-2 C?/Nm2.) If you remove a factor of €o, the remaining dimensionless 
quantity, 

€-=1+X= = (4.34) 

€0 

is called the relative permittivity, or dielectric constant, of the material. 
Dielectric constants for some common substances are listed in Table 4.2. 
(Notice that e, is greater than 1, for all ordinary materials; this just reflects 
the fact that polarization is typically in the same direction as the field.) 
Of course, the permittivity and the dielectric constant do not convey any 
information that was not already available in the susceptibility, nor is there 
anything essentially new in Eq. 4.32; the physics of linear dielectrics is all 
contained in Eq. 4.30.!! 


10 In modern optical applications especially, nonlinear materials have become increasingly 
important. For these there is a second term in the formula for P as a function of E — typically 
a cubic term. In general, Eq. 4.30 can be regarded as the first (nonzero) term in the Taylor 
expansion of P in powers of E. 

As long as we are engaged in this excess of unnecessary terminology and notation, I might 
as well mention that formulas for D in terms of E (Eq. 4.32, in the case of linear dielectrics) 
are called constitutive relations. 


11 


188 4 Electric Fields in Matter 


Table 4.2. Dielectric constants (unless otherwise specified, values given are for 1 atm, 20°C). 
Data from Handbook of Chemistry and Physics, 91st ed., CRC Press, Boca Raton (2010). 


Dielectric Dielectric 
Material constant Material constant 
Vacuum 1 Benzene 2.28 
Helium 1.000065 Diamond 5.7-5.9 
Neon 1.00013 Sodium chloride 5.9 
Hydrogen (H2) 1.000254 Silicon T 
Argon 1.000517 Methanol 33.0 
Air (dry) 1.000536 Water 80.1 
Nitrogen (N2) 1.000548 Ice (—30°C) 104 
Water vapor (100°C) 1.00589 KTaNbO; (0°C) 34,000 


Example 4.5. A metal sphere of radius a carries a charge Q (Fig. 4.23). 
It is surrounded, out to radius b, by linear dielectric material of 
permittivity e. Find the potential at the center (relative to infinity). 


Solution. To compute V, we need to know E; to find E, we might first 
try to locate the bound charge; we could get the bound charge from P, 
but we can’t calculate P unless we already know E (Eq. 4.30). We seem 
to be in a bind. What we do know is the free charge Q, and fortunately 
the arrangement is spherically symmetric, so let’s begin by calculating 
D, using Eq. 4.23: 

Q 


D = — f forall points r > a. 
Arr? 


(Inside the metal sphere, of course, E = P = D = 0.) Once we know D, 
it is a trivial matter to obtain E, using Eq. 4.32: 


Li fora <r < b, 
Amer 
E= 
—=—r forr>b. 
Aner? 


The potential at the center is therefore 


0 b Q 
v=- | eas- f ie- [ge ine dr- [oer 
Qa, 1 1 1 

ah es a 


~ an 
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As it turns out, it was not necessary for us to compute the polariza- 
tion or the bound charge explicitly, though this can easily be done: 


€0XeQ. 
P = €0XeE = f, 
0X 4rer? 
in the dielectric, and hence 
Pb =-V. P = 0, 
while 
€0Xe 
~~ s at the outer surface, 
ie at the inner surface. 
Tea? 


Notice that the surface bound charge at a is negative (ñ points outward 
with respect to the dielectric, which is +f at b but —f at a). This is 
natural, since the charge on the metal sphere attracts its opposite in all 
the dielectric molecules. It is this layer of negative charge that reduces 
the field, within the dielectric, from 1/47reo(0/r>)f to 1/4re(Q/r>)f. In 
this respect, a dielectric is rather like an imperfect conductor: on a 
conducting shell the induced surface charge would be such as to cancel 
the field of Q completely in the region a <r < b; the dielectric does the 
best it can, but the cancelation is only partial. 


You might suppose that linear dielectrics escape the defect in the parallel 
between E and D. Since P and D are now proportional to E, does it not 
follow that their curls, like E’s, must vanish? Unfortunately, it does not, 
for the line integral of P around a closed path that straddles the boundary 
between one type of material and another need not be zero, even though 
the integral of E around the same loop must be. The reason is that the 
proportionality factor €oXe is different on the two sides. For instance, at 
the interface between a polarized dielectric and the vacuum (Fig. 4.24), P 
is zero on one side but not on the other. Around this loop $ P -dl + 0, and 
hence, by Stokes’ theorem, the curl of P cannot vanish everywhere within 
the loop (in fact, it is infinite at the boundary). 1? 

Of course, if the space is entirely filled with a homogeneous! linear 
dielectric, then this objection is void; in this rather special circumstance, 


= 
All 
© 


Vacuum 
Dielectric 


= 
P0 


12 Putting that argument in differential form, Eq. 4.30 and product rule 7 yield VxP = -€0Ex 
(V Xe), so the problem arises when V ye is not parallel to E. 

13 A homogeneous medium is one whose properties (in this case the susceptibility) do not vary 
with position. 
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V-D=pr and VxD=0, 


so D can be found from the free charge just as though the dielectric were 
not there: 


D= €0E vac, 


where Eyac is the field the same free charge distribution would produce in 
the absence of any dielectric. According to Eqs. 4.32 and 4.34, therefore, 


E= ÍD = En. (4.35) 
€ €, 


Conclusion: When all space is filled with a homogeneous linear dielectric, 
the field everywhere is simply reduced by a factor of one over the dielectric 
constant. (Actually, it is not necessary for the dielectric to fill all space: in 
regions where the field is zero anyway, it can hardly matter whether the 
dielectric is present or not, since there’s no polarization in any event.) 

For example, if a free charge q is embedded in a large dielectric, the field 
it produces is 


EE (4.36) 


(that’s €, not €o), and the force it exerts on nearby charges is reduced 
accordingly. But it’s not that there is anything wrong with Coulomb’s law; 
rather, the polarization of the medium partially “shields” the charge, by 
surrounding it with bound charge of the opposite sign (Fig. 4.25).'4 


Example 4.6. A parallel-plate capacitor (Fig. 4.26) is filled with insu- 
lating material of dielectric constant €,. What effect does this have on 
its capacitance? 


Solution. Since the field is confined to the space between the plates, 
the dielectric will reduce E, and hence also the potential difference V, 


14 Tn quantum electrodynamics, the vacuum itself can be polarized, and this means that 
the effective (or “renormalized”) charge of the electron, as you might measure it in the 
laboratory, is not its true (“bare”) value, and in fact depends slightly on how far away 
you are! 
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—<t— Dielectric 


by a factor 1/e,. Accordingly, the capacitance C = Q/V is increased by 
a factor of the dielectric constant, 


C = €r Crac- (4.37) 


This is, in fact, a common way to beef up a capacitor. 


A crystal is generally easier to polarize in some directions than in 
others,!> and in this case Eq. 4.30 is replaced by the general linear relation 


Py = €0(XKeyy Ex ate Xex Ey id Xex Bz), 
P, = E€0(Xex Ex T Xey Ey F Xe,- Ez), (4.38) 
P; = E€0(Xex Ex + Xey Ey T Xe.-E=), 


just as Eq. 4.1 was superseded by Eq. 4.3 for asymmetrical molecules. The 
nine coefficients, Xe... Xexy»+ ++» constitute the susceptibility tensor. 


Problem 4.18. The space between the plates of a parallel-plate capacitor (Fig. 4.27) is filled 
with two slabs of linear dielectric material. Each slab has thickness a, so the total distance 
between the plates is 2a. Slab1has a dielectric constant of 2, and slab 2 has a dielectric constant 
of 1.5. The free charge density on the top plate is o and on the bottom plate —o. 


(a) Find the electric displacement D in each slab. 

(b) Find the electric field E in each slab. 

(c) Find the polarization P in each slab. 

(d) Find the potential difference between the plates. 
(e) Find the location and amount of all bound charge. 


15 A medium is said to be isotropic if its properties (such as susceptibility) are the same 
in all directions. Thus Eq. 4.30 is the special case of Eq. 4.38 that holds for isotropic 
media. Physicists tend to be careless with their language, and unless otherwise indicated 
the term “linear dielectric” implies “isotropic linear dielectric,” and suggests “homogeneous 
isotropic linear dielectric.” But technically, “linear” just means that at any given point, and 
for E in a given direction, the components of P are proportional to E — the proportionality 
factor could vary with position and/or direction. 
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—<t— Slab 1 


—<— Slab 2 


q— I 
Fig. 4.27 
(f) Now that you know all the charge (free and bound), recalculate the field in each slab, and 
confirm your answer to (b). 


Problem 4.19. Suppose you have just enough linear dielectric material, of dielectric constant 
€,, to half fill a parallel-plate capacitor (Fig. 4.28). By how much is the capacitance increased 
when you distribute the material as in Fig. 4.28a? How about Fig. 4.28b? For a given potential 
difference V between the plates, find E, D, and P, in each region, and the free and bound 
charge on all surfaces, for both cases. 


(a) (b) 
Fig. 4.28 
Problem 4.20. A sphere of linear dielectric material has embedded in it a uniform free charge 
density p. Find the potential at the center of the sphere (relative to infinity), if its radius is R 
and the dielectric constant is €,.. 


Problem 4.21. A certain coaxial cable consists of a copper wire, radius a, surrounded by a 
concentric copper tube of inner radius c (Fig. 4.29). The space between is partially filled (from b 
out to c) with material of dielectric constant €,., as shown. Find the capacitance per unit length 
of this cable. 


A 
S 


Fig. 4.29 
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4.4.2 Boundary Value Problems with Linear Dielectrics 


In a (homogeneous isotropic) linear dielectric, the bound charge density 
(ps) is proportional to the free charge density (p,):'° 


Xe 
l+Xe 


py =-V-P=-V- (co%<D) =-( or. (4.39) 


In particular, unless free charge is actually embedded in the material, p = 0, 
and any net charge must reside at the surface. Within such a dielectric, then, 
the potential obeys Laplace’s equation, and all the machinery of Chapter 3 
carries over. It is convenient to rewrite the boundary conditions in a way 
that makes reference only to the free charge. Equation 4.26 says 


L L 
Eabove Æzbove = EbelowL below = Of, (4.40) 


or (in terms of the potential), 


OV, OV, 
Eabove F — Ebelow — = —Of, (4.41) 


whereas the potential itself is, of course, continuous (Eq. 2.34): 


Vabove = Voelow- (4.42) 


Example 4.7. A sphere of homogeneous linear dielectric material is 
placed in an otherwise uniform electric field Ep (Fig. 4.30). Find the 
electric field inside the sphere. 


Solution. This is reminiscent of Ex. 3.8, in which an uncharged 
conducting sphere was introduced into a uniform field. In that case, the 
field of the induced charge cancels Eo within the sphere; in a dielectric, 
the cancelation (from the bound charge) is incomplete. 

Our problem is to solve Laplace’s equation, for Vin(r, 0) when r < 
R, and Vout(r, 8) when r => R, subject to the boundary conditions 


16 This does not apply to the surface charge (0p), because xe is not independent of position 
(obviously) at the boundary. 
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(i) Vin = Vout atr = R, 
(ii) ake = eo at r= R, (4.43) 


(ili) Vout > —Eor cos 0 forr > R. 


(The second of these follows from Eq. 4.41, since there is no free charge 
at the surface.) Inside the sphere, Eq. 3.65 says 


Vin(r, 0) = $ Air'P;(cos 0); (4.44) 
1=0 


outside the sphere, in view of (iii), we have 


— B 
Vou’, 8) = —Eor cos O + > — Pi(cos 0). (4.45) 
10 
Boundary condition (i) requires that 
co co B 
>) AiR’ Pi(cos 0) = -EoRcos 0 + X- zrg Pi(cos 8), 
1=0 1=0 
so!7 
B 
AR = —_ for/ #1, 
B (4.46) 
AiR =-ER+ a 
Meanwhile, condition (ii) yields 
= < (1+1)B 
Er ` 14,R'| Pi(cos 0) = -Eo cos 0 — 3 O Be Pilcos 0), 
1=0 /=0 
so 
1+1)B 
€,1A,R'! = oo forl #1, 
5 s (4.47) 
€,Ay = -Eo = ai 
It follows that 
A,=B,=0 for/#1, 
=] (4.48) 
Asa mn m E R 
+2 E€, 
Evidently, 
E, E, 
Via (r, 0) = - i rcos 0 =- a2) Z, 
E€, +2 €,+2 
and hence the field inside the sphere is (surprisingly) uniform: 
E= 2 E (4.49) 
ea , 


ber 


Remember, Pı (cos @) = cos 0, and the coefficients must be equal for each l, as you could 
prove by multiplying by Pp(cos 0)sin 0, integrating from 0 to 7, and invoking the 
orthogonality of the Legendre polynomials (Eq. 3.68). 
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| 
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Example 4.8. Suppose the entire region below the plane z = 0 in 
Fig. 4.31 is filled with uniform linear dielectric material of suscepti- 
bility Xe. Calculate the force on a point charge q situated a distance d 
above the origin. 


Solution. The surface bound charge on the xy plane is of opposite 
sign to q, so the force will be attractive. (In view of Eq. 4.39, there is 
no volume bound charge.) Let us first calculate op, using Eqs. 4.11 and 
4.30:18 


op = P -nh= P- = €0XeE-, 


where E- is the z-component of the total field just inside the dielectric, 
at z = 0. This field is due in part to q and in part to the bound charge 
itself. From Coulomb’s law, the former contribution is 

1 q 1 qd 

eee ee A eae ee 
hre (2 + a2) Are  (r? + d*)3? 
where r = 4x? + y? is the distance from the origin. The z component 
of the field of the bound charge, meanwhile, is —o;/2€9 (see footnote 
after Eq. 2.33). Thus 
1 qd Op 
a. ae GETAL Tes!” 


which we can solve for op: 


(4.50) 


op = l Xe ) qd 


On \Xe +2) (2 + dI? 
Apart from the factor Xe/(Xe+2), this is exactly the same as the induced 
charge on an infinite conducting plane under similar circumstances 
(Eq. 3.10). Evidently, the total bound charge is 


qb = “(ae (4.51) 


18 This method mimics Prob. 3.40. 
For some purposes a conductor can be regarded as the limiting case of a linear dielectric, 
with Xe — oo. This is often a useful check — try applying it to Exs. 4.5, 4.6, and 4.7. 
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We could, of course, obtain the field of op by direct integration: 


1 a 
E= — |o da. 
ATED { (Bo 7 


But, as in the case of the conducting plane, there is a nicer solution 
by the method of images. If we replace the dielectric by a single point 
charge q, at the image position (0,0, —d), we have 


= a e + | (4.52) 
4neol Ax +y + (z-d} Vor +y?+(4+dyP 

in the region z > 0. Meanwhile, a charge (q + qp) at (0,0, d) yields the 

potential 


ee. | 4+4» | 
4reo lL x? +52 + z- dY} 


for the region z < 0. Taken together, Eqs. 4.52 and 4.53 constitute 
a function that satisfies Poisson’s equation with point charge q at 
(0,0, d), which goes to zero at infinity, which is continuous at the 
boundary z = 0, and whose normal derivative exhibits the disconti- 
nuity appropriate to a surface charge o, at z = 0: 

OV 


=el az 


(4.53) 


ee ee 
z=0*  ĝz !z=0 2T \Xe +27 (x2 + y2 + d2)32" 


Accordingly, this is the correct potential for our problem. In particular, 
the force on q is 


1 qq . EER 


Are Qd“ ~ Arey Xe+2 4g” 


I do not claim to have provided a compelling motivation for 
Eqs. 4.52 and 4.53 — like all image solutions, this one owes its 
justification to the fact that it works: it solves Poisson’s equation, 
and it meets the boundary conditions. Still, discovering an image 
solution is not entirely a matter of guesswork. There are at least two 
“rules of the game.” (1) You must never put an image charge into the 
region where you're computing the potential. (Thus Eq. 4.52 gives the 
potential for z > 0, but this image charge gq, is at z = —d; when we turn 
to the region z < 0 (Eq. 4.53), the image charge (q + qb) is at z = +d.) 
(2) The image charges must add up to the correct total in each region. 
(That’s how I knew to use q, to account for the charge in the region 
z < 0, and (q + q») to cover the region z > 0.) 


(4.54) 


Problem 4.22. A very long cylinder of linear dielectric material is placed in an otherwise 
uniform electric field Eo. Find the resulting field within the cylinder. (The radius is a, the 
susceptibility Xe, and the axis is perpendicular to Ko.) 


Problem 4.23. Find the field inside a sphere of linear dielectric material in an otherwise 
uniform electric field Eọ (Ex. 4.7) by the following method of successive approximations: First 
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pretend the field inside is just Eo, and use Eq. 4.30 to write down the resulting polarization 
Po. This polarization generates a field of its own, E; (Ex. 4.2), which in turn modifies the 
polarization by an amount P4, which further changes the field by an amount E>, and so on. 
The resulting field is Kg + E; + E2 + - - - . Sum the series, and compare your answer with 
Eq. 4.49. 


Problem 4.24. An uncharged conducting sphere of radius a is coated with a thick insulating 
shell (dielectric constant €,) out to radius b. This object is now placed in an otherwise uniform 
electric field Eo. Find the electric field in the insulator. 


Problem 4.25. Suppose the region above the xy plane in Ex. 4.8 is also filled with linear 
dielectric but of a different susceptibility x7. Find the potential everywhere. 


4.4.3 Energy in Dielectric Systems 
It takes work to charge up a capacitor (Eq. 2.55): 
Lea 
W= 5C Ve. 


If the capacitor is filled with linear dielectric, its capacitance exceeds the 
vacuum value by a factor of the dielectric constant, 


C= Er Cyacs 


as we found in Ex. 4.6. Evidently, the work necessary to charge a dielectric- 
filled capacitor (to a given voltage) is increased by the same factor. The 
reason is pretty clear: you have to pump on more (free) charge, to achieve 
a given potential, because part of the field is canceled off by the bound 
charges. 

In Chapter 2, I derived a general formula for the energy stored in any 
electrostatic system (Eq. 2.45): 


= 2 f Bar. (4.55) 


The case of the dielectric-filled capacitor suggests that this should be 


changed to 
1 
W = S f ektar = 5 [D Ed 


in the presence of linear dielectrics. To prove it, suppose the dielectric 
material is fixed in position, and we bring in the free charge, a bit at a time. 
As py is increased by an amount Apy, the polarization will change and with 
it the bound charge distribution; but we’re interested only in the work done 
on the incremental free charge: 


aw = | apvd. (4.56) 
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Since V-D = py, App = V- (AD), where AD is the resulting change in D, so 
AW= fo -(AD)|V dt. 


Now 
V-[(AD)V] = [V- (AD)]V + AD. (VV), 


and hence (integrating by parts), 
AW = fy: KAD) V] d+ f (AD) -Ear 


The divergence theorem turns the first term into a surface integral, which 
vanishes if we integrate over all space. Therefore, the work done is equal to 


AW = [aD -Ear (4.57) 


So far, this applies to any material. Now, if the medium is a J/inear 
dielectric, then D = eE, so 


5A -E) = LACEE?) = e(AE) - E = (AD) -E 


(for infinitesimal increments). Thus 


AW = a(; f D-Edr). 


The total work done, then, as we build the free charge up from zero to the 
final configuration, is 


w= | D-Edr, (4.58) 


as anticipated.” 

It may puzzle you that Eq. 4.55, which we derived quite generally in 
Chapter 2, does not seem to apply in the presence of dielectrics, where it is 
replaced by Eq. 4.58. The point is not that one or the other of these equa- 
tions is wrong, but rather that they address somewhat different questions. 
The distinction is subtle, so let’s go right back to the beginning: What do 
we mean by “the energy of a system”? Answer: It is the work required to 
assemble the system. Very well — but when dielectrics are involved, there 
are two quite different ways one might construe this process: 


(1) We bring in all the charges (free and bound), one by one, with tweezers, 
and glue each one down in its proper final location. If this is what 
you mean by “assemble the system,” then Eq. 4.55 is your formula for 
the energy stored. Notice, however, that this will not include the work 


20 In case you are wondering why I did not do this more simply by the method of Section 2.4.3, 
starting with W = 4 f pr V dt, the reason is that this formula is untrue, in general. Study 
the derivation of Eq. 2.42, and you will see that it applies only to the total charge. For linear 
dielectrics it happens to hold for the free charge alone, but this is scarcely obvious a priori 
and, in fact, is most easily confirmed by working backward from Eq. 4.58. 
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involved in stretching and twisting the dielectric molecules (if we picture 
the positive and negative charges as held together by tiny springs, it does 
not include the spring energy, $kx*, associated with polarizing each 
molecule).”! 

(2) With the unpolarized dielectric in place, we bring in the free charges, one 
by one, allowing the dielectric to respond as it sees fit. If this is what you 
mean by “assemble the system” (and ordinarily it is, since free charge is 
what we actually push around), then Eq. 4.58 is the formula you want. 
In this case the “spring” energy is included, albeit indirectly, because 
the force you must apply to the free charge depends on the disposition 
of the bound charge; as you move the free charge, you are automatically 
stretching those “springs.” 


Example 4.9. A sphere of radius R is filled with material of dielectric 
constant €, and uniform embedded free charge ps. What is the energy 
of this configuration? 


Solution. From Gauss’s law (in the form of Eq. 4.23), the displace- 


ment is 
or (r< R), 
D(r) = p 
aot (> R) 
So the electric field is 
Z r (r<R), 
Er) = ; 0er 
Fi (r > R) 


The purely electrostatic energy (Eq. 4.55) is 


_ €0 pr 2 fË a, 2 ores fo ar] 
m = SIGE) Í r 4nr dr+ (35) R a Aue dr 


But the total energy (Eq. 4.58) is 
_ lf per Pr a zr ee 
m= S r 4rnr dr 


RoR (7 1 2 1 
E a a a 


21 The “spring” itself may be electrical in nature, but it is still not included in Eq. 4.55, if E is 
taken to be the macroscopic field. See Prob. 4.32. 
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Notice that W, < W — that’s because W, does not include the energy 


involved in stretching the molecules. 


Let’s check that W is the work done on the free charge in assem- 
bling the system. We start with the (uncharged, unpolarized) dielectric 
sphere, and bring in the free charge in infinitesimal installments (dq), 
filling out the sphere layer by layer (what I called the “method of 
assembly” in Prob. 2.36). When we have reached radius r’, the electric 


field is 


K(r) = 


pr r’? 


3€0 r2 


=> 


The work required to bring the next dq in from infinity to r’ is 


ar). 


(r > R). 


R r 
aw = -a| | Eas f E-dl| 
co R 


y’ 
j Pf 


(r <r< R), 


1 


13 R 
prr 1 
sad | d 
: 3€0 f. a 


This increases the radius (r’): 


3€0E; 


dq = parr? dr’, 


r 
Í 


r2 


“ 


so the total work done, in going from r’ = 0 tor’ = R, is 


Amps r | 1, rÈ 
w= (20-2) S 


2m 475, l 
= —p;R 
9€ Pr ( 


ae + — 


R 
-f rar] 
0 


Ey 


=< +1) = Wav 


Evidently, the energy “stored in the springs” is 


Wspring =W- W= 


It is sometimes alleged that Eq. 4.58 represents the energy even for 
nonlinear dielectrics, but this is false: to proceed beyond Eq. 4.57, one must 
assume linearity. In fact, for dissipative systems the whole notion of “stored 
energy” loses its meaning, because the work done depends not only on the 
final configuration but on how it got there. If the molecular “springs” are 
allowed to have some friction, for instance, then Wspring can be made as 
large as you like, by assembling the charges in such a way that the spring is 
obliged to expand and contract many times before reaching its final state. 
In particular, you get nonsensical results if you try to apply Eq. 4.58 to 


27 
45e 


P 5 
0E? : 


electrets, with frozen-in polarization (see Prob. 4.27). 


aR (e.= 1). 
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Problem 4.26. A spherical conductor, of radius a, carries a charge Q (Fig. 4.32). It is sur- 
rounded by linear dielectric material of susceptibility Xe, out to radius b. Find the energy of 
this configuration (Eq. 4.58). 


Fig. 4.32 


Problem 4.27. Calculate W, using both Eq. 4.55 and Eq. 4.58, for a sphere of radius R with 
frozen-in uniform polarization P (Ex. 4.2). Comment on the discrepancy. Which (if either) is the 
“true” energy of the system? 


4.4.4 Forces on Dielectrics 


Just as a conductor is attracted into an electric field (Eq. 2.51), so too is a 
dielectric — and for essentially the same reason: the bound charge tends to 
accumulate near the free charge of the opposite sign. But the calculation of 
forces on dielectrics can be surprisingly tricky. Consider, for example, the 
case of a slab of linear dielectric material, partially inserted between the 
plates of a parallel-plate capacitor (Fig. 4.33). We have always pretended 
that the field is uniform inside a parallel-plate capacitor, and zero outside. 
If this were literally true, there would be no net force on the dielectric at all, 
since the field everywhere would be perpendicular to the plates. However, 
there is in reality a fringing field around the edges, which for most purposes 
can be ignored, but in this case is responsible for the whole effect. (Indeed, 


| 


Dielectric 


202 


4 Electric Fields in Matter 


Fringing region 


the field could not terminate abruptly at the edge of the capacitor, for if it 
did, the line integral of E around the closed loop shown in Fig. 4.34 would 
not be zero.) It is this nonuniform fringing field that pulls the dielectric into 
the capacitor. 

Fringing fields are notoriously difficult to calculate; luckily, we can avoid 
this altogether, by the following ingenious method.”* Let W be the energy 
of the system — it depends, of course, on the amount of overlap. If I pull 
the dielectric out an infinitesimal distance dx, the energy is changed by an 
amount equal to the work done: 


dW = Fine dx, (4.59) 


where Fine is the force I must exert, to counteract the electrical force F on 
the dielectric: Fine = —F. Thus the electrical force on the slab is 


=- we (4.60) 
Now, the energy stored in the capacitor is 
W = e, (4.61) 
and the capacitance in this case is 
C= Y ce, ~ eX), (4.62) 


where / is the length of the plates (Fig. 4.33). Let’s assume that the total 
charge on the plates (Q = CV) is held constant, as the dielectric moves. In 
terms of Q, 


1 Q? 
Wa 4. 
2C Oey) 
so 
_ dW _1@dC_ 1 dC 
iaa 5 2 ae dx” oe 


22 For a direct calculation from the fringing fields, see E. R. Dietz, Am. J. Phys. 72, 1499 
(2004). 
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But 
dC €0Xew 
dx d ’ 
and hence 
E0XeW 2 
= -———_ J’. 4. 
F od yV (4.65) 


(The minus sign indicates that the force is in the negative x direction; the 
dielectric is pulled into the capacitor.) 
It is an infamous blunder to use Eq. 4.61 (with V constant), rather than 


Eq. 4.63 (with Q constant), in computing the force. One then obtains 
1_,dC 

F=--Vy?— 

2 dx’ 


which is off by a sign. It is, of course, possible to maintain the capacitor at a 
fixed potential, by connecting it up to a battery. But in that case the battery 
also does work as the dielectric moves; instead of Eq. 4.59, we now have 


dW = Fme dx + V dQ, (4.66) 


where V dQ is the work done by the battery. It follows that 


p- W yi __l yaC p4 _ 1, ,de 


dx dx 2 dx dx 2 dx’ 


(4.67) 


the same as before (Eq. 4.64), with the correct sign. 

Please understand: The force on the dielectric cannot possibly depend 
on whether you plan to hold Q constant or V constant — it is determined 
entirely by the distribution of charge, free and bound, at the moment in 
question. It’s simpler to calculate the force assuming constant Q, because 
then you don’t have to worry about work done by the battery; but if you 
insist, it can be done correctly either way. 

Notice that we were able to determine the force without knowing anything 
about the fringing fields that are ultimately responsible for it! Of course, it’s 
built into the whole structure of electrostatics that V x E = 0, and hence 
that the fringing fields must be present; we’re not really getting something 
for nothing here — just cleverly exploiting the internal consistency of the 
theory. The energy stored in the fringing fields themselves (which was not 
accounted for in this derivation) stays constant, as the slab moves; what 
does change is the energy well inside the capacitor, where the field is nice 
and uniform. 


Problem 4.28. Two long coaxial cylindrical metal tubes (inner radius a, outer radius b) 
stand vertically in a tank of dielectric oil (susceptibility Xe, mass density p). The inner one is 
maintained at potential V, and the outer one is grounded (Fig. 4.35). To what height (/) does 
the oil rise, in the space between the tubes? 


204 4 Electric Fields in Matter 


Oil ————_ 
h 
mn 
PIO —— WAM 
PI 
we 
woo 
a ANNANN PAN WPAN 
MNN ANNA ANNA 
Fig. 4.35 


More Problems on Chapter 4 


Problem 4.29. 

(a) For the configuration in Prob. 4.5, calculate the force on p, due to p,, and the force on p 
due to p>. Are the answers consistent with Newton’ third law? 

(b) Find the total torque on p, with respect to the center of p,, and compare it with the torque 
on p; about that same point. [Hint: Combine your answer to (a) with the result of Prob. 4.5.] 


Problem 4.30. An electric dipole p, pointing in the y direction, is placed midway between two 
large conducting plates, as shown in Fig. 4.36. Each plate makes a small angle @ with respect 
to the x-axis, and they are maintained at potentials + V. What is the direction of the net force 
on p? (There's nothing to calculate, here, but do explain your answer qualitatively.) 


Fig. 4.36 
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Problem 4.31. A point charge Q is “nailed down’ on a table. Around it, at radius R, is a 
frictionless circular track on which a dipole p rides, constrained always to point tangent to the 
circle. Use Eq. 4.5 to show that the electric force on the dipole is 


Q p 


F 4T EQ R` 


Notice that this force is always in the “forward” direction (you can easily confirm this by drawing 
a diagram showing the forces on the two ends of the dipole). Why isn’t this a perpetual motion 
machine?” 


Problem 4.32. As a model for a polarizable atom, consider two point charges +q connected 
by a spring of constant X and equilibrium length zero. That is, the force on +g (with —q nailed 
down at the origin) would be —kx, where x is their separation. (The “spring” here is a surrogate 
for whatever holds the molecule together — it includes the electrical attraction of the other end. 
If it bothers you that the force is taken to be proportional to the separation — not, say, to 1/7% — 
look again at Ex. 4.1 or Probs. 4.2 and 4.3; realistic models, with the charges smeared out, lead 
naturally to a Hooke'’s law force, at least for small displacements.) 


(a) Find the polarizability œ, in terms of k and q. 

(b) Now calculate the energy of an induced dipole p, in an electric field E. [Hint: The work 
required to bring in the charges and place them in position is still given by Eq. 4.6. But in 
addition to that we must stretch the spring to its appropriate length. How much work does 
that take?] 


Problem 4.33. Earnshaw’s theorem (Prob. 3.2) says that you cannot trap a charged particle in 
an electrostatic field. Question: Could you trap a neutral (but polarizable) atom in an electrostatic 
field? 


(a) Show that the force on the atom is F = 5aV(E?). Note that the dipole is drawn into a 
region of stronger field. 

(b) The question becomes, therefore, whether it is possible for E? to have a local maximum 
(in a charge-free region). In that case the force would push the atom back to its equilibrium 
position. Show that the answer is no. [Hint: Use Prob. 3.4a.]“ 


Problem 4.34. A dielectric cube of side a, centered at the origin, carries a “frozen-in” polar- 
ization P = Ar, where k is a constant. Find all the bound charges, and check that they add up 
to zero. 


Problem 4.35. The space between the plates of a parallel-plate capacitor is filled with dielec- 
tric material whose dielectric constant varies linearly from 1 at the bottom plate (x = 0) to 
2 at the top plate (x = d). The capacitor is connected to a battery of voltage V. Find all the 
bound charge, and check that the total is zero. 


23 This charming paradox was suggested by K. Brownstein. 
24 Interestingly, it can be done with oscillating fields. See Prob. 9.44. 
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Problem 4.36. A point charge q is embedded at the center of a sphere of linear dielectric 
material (with susceptibility Xe and radius R). Find the electric field, the polarization, and the 
bound charge densities, o, and ap. What is the total bound charge on the surface? Where is 
the compensating negative bound charge located? 


Problem 4.37. At the interface between one linear dielectric and another, the electric field 
lines bend (see Fig. 4.37). Show that 
tan 02/ tan 01 = €9/e1, (4.68) 


assuming there is no free charge at the boundary. [Comment: Eq. 4.68 is reminiscent of Snell's 
law in optics. Would a convex “lens” of dielectric material tend to “focus,” or “defocus,’the electric 
field?) 


Fig. 4.37 


Problem 4.38. A point dipole p is embedded at the center of a sphere of linear dielectric 
material (with radius Rand dielectricconstant €,.). Find the electricpotential inside and outside 
the sphere. 


3 (e,— 
a 21 (e€r- 1) 


pcos 0 3 
Answer: 
| Aner? R3 (€; + 2) ( 


) (sR); re, 


Amegr? ) C= P]. 
Problem 4.39. Prove the following uniqueness theorem: A volume V contains a specified free 
charge distribution, and various pieces of linear dielectric material, with the susceptibility of 
each one given. If the potential is specified on the boundaries S of V(V = O at infinity 
would be suitable) then the potential throughout V is uniquely determined. [Hint: Integrate 
V-( V3D3) over V.] 


© 
Fig. 4.38 


Problem 4.40. A conducting sphere at potential Vo is half embedded in linear dielectric 
material of susceptibility Xe, which occupies the region z < O (Fig. 4.38). Claim: The potential 
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everywhere is exactly the same as it would have been in the absence of the dielectric! Check this 

claim, as follows: 

(a) Write down the formula for the proposed potential V (r), in terms of Vo, R, and r. Use it 
to determine the field, the polarization, the bound charge, and the free charge distribution 
on the sphere. 

(b) Show that the resulting charge configuration would indeed produce the potential V(r). 

(c) Appeal to the uniqueness theorem in Prob. 4.39 to complete the argument. 

(d) Could you solve the configurations in Fig. 4.39 with the same potential? If not, explain why. 


nA fv 
(a) (b) 
Fig. 4.39 


Problem 4.41. According to Eq. 4.5, the force on a single dipole is (p - V )E, so the net force 
on a dielectric object is 


F= fe -V)Eext dT. (4.69) 


[Here Eext is the field of everything except the dielectric. You might assume that it wouldn't 
matter if you used the total field; after all, the dielectric can't exert a force on itself. However, 
because the field of the dielectric is discontinuous at the location of any bound surface charge, 
the derivative introduces a spurious delta function, and it is safer to stick with Eext.] Use Eq. 4.69 
to determine the force on a tiny sphere, of radius R, composed of linear dielectric material 
of susceptibility Xe, which is situated a distance s from a fine wire carrying a uniform line 
charge A. 


Problem 4.42. Ina linear dielectric, the polarization is proportional to the field: P = €0XeE. 
If the material consists of atoms (or nonpolar molecules), the induced dipole moment of each 
one is likewise proportional to the field p = œE. Question: What is the relation between the 
atomic polarizability œ and the susceptibility Xe? 
Since P (the dipole moment per unit volume) is p (the dipole moment per atom) times VV 
(the number of atoms per unit volume), P = Np = NaE, one’ first inclination is to say that 
Na 


Ke~ (4.70) 


€0 
And in fact this is not far off, if the density is low. But closer inspection reveals a subtle problem, 
for the field E in Eq. 4.30 is the total macroscopic field in the medium, whereas the field in 
Eq. 4.1 is due to everything except the particular atom under consideration (polarizability was 
defined for an isolated atom subject to a specified external field); call this field Esise. Imagine 
that the space allotted to each atom is a sphere of radius R, and show that 


E=(1- Fe) Bele (4.71) 
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Use this to conclude that 


z Naleo 
oT Nal3eg’ 
or 
= 3€0 E€r— 1 
a= Teas) (4.72) 


Equation 4.72 is known as the Clausius—Mossotti formula, or, in its application to optics, the 
Lorentz—Lorenz equation. 


Problem 4.43. Check the Clausius—Mossotti relation (Eq. 4.72) for the gases listed in Table 4.1. 
(Dielectric constants are given in Table 4.2.) (The densities here are so small that Eqs. 4.70 and 
4.72 are indistinguishable. For experimental data that confirm the Clausius—Mossotti correction 
term see, for instance, the first edition of Purcell’s Electricity and Magnetism, Problem 9.28.) 


Problem 4.44. The Clausius—Mossotti equation (Prob. 4.42) tells you how to calculate the 
susceptibility of a nonpolar substance, in terms of the atomic polarizability a. The Langevin 
equation tells you how to calculate the susceptibility of a polar substance, in terms of the 
permanent molecular dipole moment p. Here's how it goes: 


(a) The energy of a dipole in an external field Eis u = —p-E = -pE cos 0 (Eq. 4.6), where 
O is the usual polar angle, if we orient the z-axis along E. Statistical mechanics says that 
for a material in equilibrium at absolute temperature 7”, the probability of a given molecule 
having energy u is proportional to the Boltzmann factor, 


exp(-u/kT). 
The average energy of the dipoles is therefore 
f ye KT) dQ 
= f e-WkT) do A 


where dQ = sin 0 dO dọ, and the integration is over all orientations (0: 0 — 
mt; @: 0 — 271). Use this to show that the polarization of a substance containing NV 
molecules per unit volume is 


(u) 


P = Np|coth(pEIkT) — (kTIpE)]. (4.73) 


That's the Langevin formula. Sketch P/Np as a function of pE/AT. 
Notice that for large fields/low temperatures, virtually a// the molecules are lined up, and 


= 
a 
= 


the material is nonlinear. Ordinarily, however, AT is much greater than pE. Show that 
in this régime the material js linear, and calculate its susceptibility, in terms of N, p, T, 
and k. Compute the susceptibility of water at 20°C, and compare the experimental value 
in Table 4.2. (The dipole moment of water is 6.1 x 10730 Cm.) This is rather far off, 
because we have again neglected the distinction between E and Estse- The agreement is 
better in low-density gases, for which the difference between E and Estse is negligible. 
Try it for water vapor at 100°C and 1 atm. 


25 E, M. Purcell, Electricity and Magnetism (Berkeley Physics Course, Vol. 2), McGraw-Hill, 


New York (1963). 


Magnetostatics 


5.1 The Lorentz Force Law 
£4.48#§&=é iii 


5.1.1 Magnetic Fields 


Remember the basic problem of classical electrodynamics: we have a 
collection of charges q1, q2, q3,... (the “source” charges), and we want to 
calculate the force they exert on some other charge Q (the “test” charge — 
Fig. 2.1). According to the principle of superposition, it is sufficient to find 
the force of a single source charge — the total is then the vector sum of 
all the individual forces. Up to now, we have confined our attention to the 
simplest case, electrostatics, in which the source charge is at rest (though the 
test charge need not be). The time has come to consider the forces between 
charges in motion. 

To give you some sense of what is in store, imagine that I set up the fol- 
lowing demonstration: Two wires hang from the ceiling, a few centimeters 
apart; when I turn on a current, so that it passes up one wire and back down 
the other, the wires jump apart — they repel one another (Fig. 5.1a). How 
do we explain this? You might suppose that the battery (or whatever drives 
the current) is actually charging up the wire, and that the force is simply due 
to the electrical repulsion of like charges. But this is incorrect. I could hold 
up a test charge near these wires, and there would be no force on it,! for 
the wires are in fact electrically neutral. (It’s true that electrons are flowing 
down the line— that’s what a current is— but there are just as many stationary 
plus charges as moving minus charges on any given segment.) Moreover, if 
I hook up my demonstration so as to make the current flow up both wires 
(Fig. 5.1b), they are found to attract! 

Whatever force accounts for the attraction of parallel currents and the 
repulsion of antiparallel ones is not electrostatic in nature. It is our first 
encounter with a magnetic force. Whereas a stationary charge produces 
only an electric field E in the space around it, a moving charge generates, 
in addition, a magnetic field B. In fact, magnetic fields are a lot easier to 
detect, in practice — all you need is a Boy Scout compass. How these devices 
work is irrelevant at the moment; it is enough to know that the needle points 
in the direction of the local magnetic field. Ordinarily, this means north, in 
response to the Earth’s magnetic field, but in the laboratory, where typical 


' This is not precisely true, as we shall see in Prob. 7.45 
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(a) Currents in opposite (b) Currents in the same 
directions repel. direction attract. 


Current 


ED 
| Magnetic field 


Wire | 


fields may be hundreds of times stronger than that, the compass indicates 
the direction of whatever magnetic field is present. 

Now, if you hold up a tiny compass in the vicinity of a current-carrying 
wire, you quickly discover a very peculiar thing: the field does not point 
toward the wire, nor away from it, but rather it circles around the wire. In 
fact, if you grab the wire with your right hand — thumb in the direction of 
the current — your fingers curl around in the direction of the magnetic field 
(Fig. 5.2). How can such a field lead to a force of attraction on a nearby 
parallel current? At the second wire, the magnetic field points into the page 
(Fig. 5.3), the current is upward, and yet the resulting force is to the left! It’s 
going to take a strange law to account for these directions. 
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5.1.2 Magnetic Forces 


In fact, this combination of directions is just right for a cross product: the 
magnetic force on a charge Q, moving with velocity v in a magnetic field 


B, is? 
Fmag = OV x B). (5.1) 


This is known as the Lorentz force law.’ In the presence of both electric and 
magnetic fields, the net force on Q would be 


F = O[E+(vxB)]. (5.2) 


I do not pretend to have derived Eq. 5.1, of course; it is a fundamental 
axiom of the theory, whose justification is to be found in experiments such 
as the one I described in Section 5.1.1. 

Our main job from now on is to calculate the magnetic field B (and for 
that matter the electric field E as well; the rules are more complicated when 
the source charges are in motion). But before we proceed, it is worthwhile 
to take a closer look at the Lorentz force law itself; it is a peculiar law, and 
it leads to some truly bizarre particle trajectories. 


Example 5.1. (Cyclotron motion) The archetypical motion of a 
charged particle in a magnetic field is circular, with the magnetic force 
providing the centripetal acceleration. In Fig. 5.4, a uniform magnetic 
field points into the page; if the charge Q moves counterclockwise, 
with speed v, around a circle of radius R, the magnetic force points 
inward, and has a fixed magnitude QvB — just right to sustain uniform 
circular motion: 


2, 
OvB = mo, or p = OBR, (5.3) 


where m is the particle’s mass and p = mv is its momentum. 
Equation 5.3 is known as the cyclotron formula because it describes the 
motion of a particle in a cyclotron — the first of the modern particle 
accelerators. It also suggests a simple experimental technique for 
finding the momentum of a charged particle: send it through a region 
of known magnetic field, and measure the radius of its trajectory. 
This is in fact the standard means for determining the momenta of 
elementary particles. 

I assumed that the charge moves in a plane perpendicular to B. If it 
starts out with some additional speed v) parallel to B, this component 
of the motion is unaffected by the magnetic field, and the particle 
moves in a helix (Fig. 5.5). The radius is still given by Eq. 5.3, but 
the velocity in question is now the component perpendicular to B: call 
it vı. 


2 Since F and v are vectors, B is actually a pseudovector. 
3 Actually, it is due to Oliver Heaviside. 
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Example 5.2. (Cycloid motion) A more exotic trajectory occurs if we 
include a uniform electric field, at right angles to the magnetic one. 
Suppose, for instance, that B points in the x direction, and E in the 
z direction, as shown in Fig. 5.6. A positive charge is released from the 
origin; what path will it follow? 


Solution. Let’s think it through qualitatively, first. Initially, the par- 
ticle is at rest, so the magnetic force is zero, and the electric field 
accelerates the charge in the z direction. As it picks up speed, a 
magnetic force develops which, according to Eq. 5.1, pulls the charge 
around to the right. The faster it goes, the stronger Fag becomes; 
eventually, it curves the particle back around toward the y-axis. At this 
point the charge is moving against the electrical force, so it begins to 
slow down — the magnetic force then decreases, and the electrical force 
takes over, bringing the particle to rest at point a, in Fig. 5.6. There the 
entire process commences anew, carrying the particle over to point b, 
and so on. 

Now let’s do it quantitatively. There being no force in the x direction, 
the position of the particle at any time ¢ can be described by the vector 
r(t) = (0, y(t), z(t)); the velocity is therefore 


v = (0, y, 2), 


where the dots denote time derivatives. Thus 
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and hence, applying Newton’s second law, 
F = Q(Œ + v x B) = O(£z + Bzy — Byz) = ma = m(y + 22). 
Or, treating the y and z components separately, 
OBz=my, QE- QOBy=mi. 
For convenience, let 
w = =. (5.4) 


(This is the cyclotron frequency, at which the particle would revolve in 
the absence of any electric field.) Then the equations of motion take 


the form 
p= aż, z= w(> - 3). (5.5) 
Their general solution’ is 
y(t) = Cı cos wt + Cy sin wt + (E/B)t+ a 6.6 
z(t) = C cos wt — Cı sin wt + C4. 


But the particle started from rest (y(0)=ż(0)=0), at the origin 
@(0) = z(0)=0); these four conditions determine the constants C4, 
C, C3, and C4: 


E E 
y(t) = —(@t-sinwt), z(t) = — (l — cos wt). (5.7) 
wB wB 
In this form, the answer is not terribly enlightening,’ but if we let 
R=— (5.8) 


and eliminate the sines and cosines by exploiting the trigonometric 
identity sin? wt + cos? wt = 1, we find that 


(y— Rwt) + (z—- RP = R. (5.9) 
This is the formula for a circle, of radius R, whose center (0, Rwt, R) 
travels in the y direction at a constant speed 
E 


u=WR= >. (5.10) 


The particle moves as though it were a spot on the rim of a wheel 
rolling along the y-axis. The curve generated in this way is called a 
cycloid. Notice that the overall motion is not in the direction of E, as 
you might expect, but perpendicular to it. 


A 


As coupled differential equations, they are easily solved by differentiating the first and using 
the second to eliminate 2. 
Notice that v2 =)? + żŻ? = [(2E/B) sin(wt/2)}?, so this solution (which was based on the 
nonrelativistic form of Newton’s second law) is valid only when 2E « cB, where c is the 
speed of light. 


wn 
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One implication of the Lorentz force law (Eq. 5.1) deserves special 
attention: 


magnetic forces do no work. 


For if Q moves an amount dl = v dt, the work done is 
dWmag = Fmag dl = O(v x B) -vdt =0. (5.11) 


This follows because (v xB) is perpendicular to v, so (vx B)-v = 0. Magnetic 
forces may alter the direction in which a particle moves, but they cannot 
speed it up or slow it down. The fact that magnetic forces do no work is an 
elementary and direct consequence of the Lorentz force law, but there are 
many situations in which it appears so manifestly false that one’s confidence 
is bound to waver. When a magnetic crane lifts the carcass of a junked car, 
for instance, something is obviously doing work, and it seems perverse to 
deny that the magnetic force is responsible. Well, perverse or not, deny it we 
must, and it can be a very subtle matter to figure out who does deserve the 
credit in such circumstances. We’ll see a cute example in the next section, 
but the full story will have to await Chapter 8. 


Problem 5.1. A particle of charge q enters a region of uniform magnetic field B (pointing into 
the page). The field deflects the particle a distance d above the original line of flight, as shown 
in Fig. 5.7. Is the charge positive or negative? In terms of a, d, B, and q, find the momentum of 
the particle. 


i 


SS 
q l a l 
y 
Field region 
Fig. 5.7 


Problem 5.2. Find and sketch the trajectory of the particle in Ex. 5.2, if it starts at the origin 
with velocity 


(a) v(0) = (E/B)y; 
(b) v(0) = (E/2B)ĵ; 
() v(0) = (E/B)(¥ +2). 


Problem 5.3. In 1897, J. J. Thomson “discovered” the electron by measuring the charge-to- 
mass ratio of “cathode rays” (actually, streams of electrons, with charge q and mass 7m) as 
follows: 


(a) First he passed the beam through uniform crossed electric and magnetic fields E and B 
(mutually perpendicular, and both of them perpendicular to the beam), and adjusted the 
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electric field until he got zero deflection. What, then, was the speed of the particles (in terms 
of E and B)? 

(b) Then he turned off the electric field, and measured the radius of curvature, R, of the beam, 
as deflected by the magnetic field alone. In terms of Æ, B, and R, what is the charge-to- 
mass ratio (q/m) of the particles? 


5.13 Currents 


The current in a wire is the charge per unit time passing a given point. By 
definition, negative charges moving to the left count the same as positive 
ones to the right. This conveniently reflects the physical fact that almost all 
phenomena involving moving charges depend on the product of charge and 
velocity — if you reverse the signs of q and v, you get the same answer, so 
it doesn’t really matter which you have. (The Lorentz force law is a case in 
point; the Hall effect (Prob. 5.44) is a notorious exception.) In practice, it 
is ordinarily the negatively charged electrons that do the moving — in the 
direction opposite to the electric current. To avoid the petty complications 
this entails, I shall often pretend it’s the positive charges that move, as in 
fact everyone assumed they did for a century or so after Benjamin Franklin 
established his unfortunate convention. Current is measured in coulombs 
per second, or amperes (A): 


1A = 1C/s. (5.12) 
A line charge A traveling down a wire at speed v (Fig. 5.8) constitutes a 
current 
I= Av, (5.13) 
because a segment of length vAt, carrying charge AvAt, passes point P in a 
time interval Aż. Current is actually a vector: 
I= Av. (5.14) 


Because the path of the flow is dictated by the shape of the wire, one doesn’t 
ordinarily bother to display the direction of I explicitly, but when it comes 
to surface and volume currents we cannot afford to be so casual, and for 
the sake of notational consistency it is a good idea to acknowledge the 


vAt 


6 Tf we called the electron plus and the proton minus, the problem would never arise. In the 
context of Franklin’s experiments with cat’s fur and glass rods, the choice was completely 
arbitrary. 
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vectorial character of currents right from the start.’ A neutral wire, of 
course, contains as many stationary positive charges as mobile negative 
ones. The former do not contribute to the current — the charge density A in 
Eq. 5.13 refers only to the moving charges. In the unusual situation where 
both types move, I = A,v, + A_v_.8 

The magnetic force on a segment of current-carrying wire is 


Frag = | (7 xB)dg= f (wx ByAdt= | axma (5.15) 


Inasmuch as I and dl both point in the same direction, we can just as well 
write this as 


Fag = | (dl xB). (5.16) 


Typically, the current is constant (in magnitude) along the wire, and in that 
case J comes outside the integral: 


Finag = rfa x B). (5.17) 


Example 5.3. A rectangular loop of wire, supporting a mass m, hangs 
vertically with one end in a uniform magnetic field B, which points 
into the page in the shaded region of Fig. 5.9. For what current 7, 
in the loop, would the magnetic force upward exactly balance the 
gravitational force downward? 


Solution. First of all, the current must circulate clockwise, in order 
for (I x B) in the horizontal segment to point upward. The force is 


Fmag = [Ba, 


where a is the width of the loop. (The magnetic forces on the two 
vertical segments cancel.) For Fmag to balance the weight (mg), we must 
therefore have 


7 Tf it bothers you to treat current as a vector, please go to my website and read the note “Is 
Current a Vector?”: www.reed.edu/physics/faculty/griffiths/VectorCurrent.pdf. 

8 In principle there could even be multiple species of charge (some positive and some 
negative), each with its own density and velocity; in that case I = J}; A;v;. But ordinarily 
the simple form (5.14) will suffice. 
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_ mg 
at, 
The weight just hangs there, suspended in midair! 
What happens if we now increase the current? Then the upward 
magnetic force exceeds the downward force of gravity, and the loop 
rises, lifting the weight. Somebody’s doing work, and it sure looks as 
though the magnetic force is responsible. Indeed, one is tempted to 
write 


(5.18) 


Wrag = Fmagh = IBah, (5.19) 


where A is the distance the loop rises. But we know that magnetic forces 
never do work. What’s going on here? 

Well, when the loop starts to rise, the charges in the wire are no 
longer moving horizontally — their velocity now acquires an upward 
component u, the speed of the loop (Fig. 5.10), in addition to the 
horizontal component w associated with the current (J = Aw). The 
magnetic force, which is always perpendicular to the velocity, no longer 
points straight up, but tilts back. Itis perpendicular to the net displace- 
ment of the charge (which is in the direction of v), and therefore it does 
no work on q. It does have a vertical component (qwB); the net vertical 
force on all the charge (Aa) in the upper segment of the loop is 


Fyert = AawB = IBa (5.20) 


(as before); but now it also has a horizontal component (quB), which 
opposes the flow of current. Whoever is in charge of maintaining that 
current, therefore, must now push those charges along, against the 
backward component of the magnetic force. 

The total horizontal force on the top segment is 


Fro = AauB. (5.21) 


In a time dt, the charges move a (horizontal) distance w dt, so the work 
done by this agency (presumably a battery or a generator) is 


Woattery = AaB f uw dt = IBah, 


which is precisely what we naïvely attributed to the magnetic force in 
Eq. 5.19. Was work done in this process? Absolutely! Who did it? 


quB 
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F mop 


The battery! What, then, was the role of the magnetic force? Well, it 
redirected the horizontal force of the battery into the vertical motion 
of the loop and the weight.’ 

It may help to consider a mechanical analogy. Imagine you’re 
sliding a trunk up a frictionless ramp, by pushing on it horizontally 
with a mop (Fig. 5.11). The normal force (N) does no work, because 
it is perpendicular to the displacement. But it does have a vertical 
component (which in fact is what lifts the trunk), and a (backward) 
horizontal component (which you have to overcome by pushing on the 
mop). Who is doing the work here? You are, obviously — and yet your 
force (which is purely horizontal) is not (at least, not directly) what lifts 
the box. The normal force plays the same passive (but crucial) role as 
the magnetic force in Ex. 5.3: while doing no work itself, it redirects 
the efforts of the active agent (you, or the battery, as the case may be), 
from horizontal to vertical. 


When charge flows over a surface, we describe it by the surface current 
density, K, defined as follows: Consider a “ribbon” of infinitesimal width 
dl, , running parallel to the flow (Fig. 5.12). If the current in this ribbon is 
dl, the surface current density is 


K 


dl 
dl, 


9 If you like, the vertical component of Fmag does work lifting the weight, but the horizontal 


component does equal negative work opposing the current. However you look at it, the net 


work done by the magnetic force is zero. 


aaa (5.22) 
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(Veh 


In words, K is the current per unit width. In particular, if the (mobile) surface 
charge density is ø and its velocity is v, then 


K = ov. (5.23) 
In general, K will vary from point to point over the surface, reflecting 
variations in o and/or v. The magnetic force on the surface current is 


Fmag = fo x B)o da = fa x B) da. (5.24) 


Caveat: Just as E suffers a discontinuity at a surface charge, so B is 
discontinuous at a surface current. In Eq. 5.24, you must be careful to use 
the average field, just as we did in Section 2.5.3. 

When the flow of charge is distributed throughout a three-dimensional 
region, we describe it by the volume current density, J, defined as follows: 
Consider a “tube” of infinitesimal cross section da, , running parallel to the 
flow (Fig. 5.13). If the current in this tube is dI, the volume current density is 


J=—. (5.25) 


In words, J is the current per unit area. If the (mobile) volume charge density 
is p and the velocity is v, then 


J = pv. (5.26) 


The magnetic force on a volume current is therefore 


Frag = | x B)pdt = | axar. (5.27) 


Example 5.4. 


(a) A current 7 is uniformly distributed over a wire of circular cross 
section, with radius a (Fig. 5.14). Find the volume current density 


J: 
Solution. The area (perpendicular to the flow) is 7a’, so 
it 
~ na 


This was trivial because the current density was uniform. 
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sdo 


) jy 


I — 


(b) Suppose the current density in the wire is proportional to the 
distance from the axis, 


J=ks 
(for some constant k). Find the total current in the wire. 


Solution. Because J varies with s, we must integrate Eq. 5.25. The 
current through the shaded patch (Fig. 5.15) is Jda,, and da, = 
sdsd@. So 


2nka3 
r: 


= f Uksyisdsadgy = 27k [as = 
0 


According to Eq. 5.25, the total current crossing a surface S can be 


written as 
t= f Jda,= | Seda (5.28) 
S S 


(The dot product serves neatly to pick out the perpendicular component of 
da.) In particular, the charge per unit time leaving a volume V is 


PIa | V- Dar 
S v 


Because charge is conserved, whatever flows out through the surface must 
come at the expense of what remains inside: 


-i f oara [12 
[o Da=-5 f par=- f (Pyar. 


(The minus sign reflects the fact that an outward flow decreases the charge 
left in V.) Since this applies to any volume, we conclude that 


V-J=-—. (5.29) 


This is the precise mathematical statement of local charge conservation; it 
is called the continuity equation. 

For future reference, let me summarize the “dictionary” we have devel- 
oped for translating equations into the forms appropriate to point, line, 
surface, and volume currents: 


X O~ [old f Oka f orn. (5.30) 
i=1 


line surface volume 


5.2 The Biot—Savart Law 


This correspondence, which is analogous to q ~ A dl ~ o da ~ p dr for the 
various charge distributions, generates Eqs. 5.15, 5.24, and 5.27 from the 
original Lorentz force law (5.1). 


Problem 5.4. Suppose that the magnetic field in some region has the form 
B = kzx 


(where k is a constant). Find the force on a square loop (side length a), lying in the yz plane and 
centered at the origin, if it carries a current Z, flowing counterclockwise, when you look down 
the x-axis. 


Problem 5.5. A current Z flows down a wire of radius a. 


(a) Ifitis uniformly distributed over the surface, what is the surface current density K? 
(b) Ifitis distributed in such a way that the volume current density is inversely proportional to 
the distance from the axis, what is J (s)? 


Problem 5.6. 


(a) Aphonograph record carries a uniform density of “static electricity” o. Ifit rotates at angular 
velocity w, what is the surface current density K at a distance r from the center? 

(b) A uniformly charged solid sphere, of radius R and total charge Q, is centered at the origin 
and spinning at a constant angular velocity œ about the z-axis. Find the current density J 
at any point (r, O, œ) within the sphere. 


Problem 5.7. For a configuration of charges and currents confined within a volume V, 
show that 


ve Jdr = dpldt, (5.31) 
v 


where p is the total dipole moment. [Hint: Evaluate i V-(xJ) dt] 


5.2 The Biot-Savart Law 
SSeS 


5.2.1 Steady Currents 


Stationary charges produce electric fields that are constant in time; hence 
the term electrostatics.!° Steady currents produce magnetic fields that are 
constant in time; the theory of steady currents is called magnetostatics. 


10 Actually, it is not necessary that the charges be stationary, but only that the charge density 
at each point be constant. For example, the sphere in Prob. 5.6b produces an electrostatic 
field 1/47¢€9(Q/r”)#, even though it is rotating, because p does not depend on t. 
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Stationary charges = constant electric fields: electrostatics. 


Steady currents => constant magnetic fields: magnetostatics. 


By steady current I mean a continuous flow that has been going on forever, 
without change and without charge piling up anywhere. (Some people 
call them “stationary currents”; to my ear, that’s an oxymoron.) Formally, 
electro/magnetostatics is the régime 


dp ðJ 
ao 0, a 0, (5.32) 
at all places and all times. Of course, there’s no such thing in practice as a 
truly steady current, any more than there is a truly stationary charge. In this 
sense, both electrostatics and magnetostatics describe artificial worlds that 
exist only in textbooks. However, they represent suitable approximations as 
long as the actual fluctuations are remote, or gradual — in fact, for most 
purposes magnetostatics applies very well to household currents, which 
alternate 120 times a second! 

Notice, however, that a moving point charge cannot possibly constitute a 
steady current. If it’s here one instant, it’s gone the next. This may seem like 
a minor thing to you, but it’s a major headache for me. I developed each 
topic in electrostatics by starting out with the simple case of a point charge 
at rest; then I generalized to an arbitrary charge distribution by invoking the 
superposition principle. This approach is not open to us in magnetostatics 
because a moving point charge does not produce a static field in the first 
place. We are forced to deal with extended current distributions right from 
the start, and, as a result, the arguments are bound to be more cumbersome. 

When a steady current flows in a wire, its magnitude 7 must be the 
same all along the line; otherwise, charge would be piling up somewhere, 
and it wouldn’t be a steady current. More generally, since 0p/dt=0 in 
magnetostatics, the continuity equation (5.29) becomes 


vV-J=0. (5.33) 


5.2.2 The Magnetic Field of a Steady Current 


The magnetic field of a steady line current is given by the Biot—Savart law: 


Bir) = m far = arfa xA (5.34) 


4r yw ~ 4 a2 


The integration is along the current path, in the direction of the flow; dl’ is 
an element of length along the wire, and 2, as always, is the vector from the 
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source to the point r (Fig. 5.16). The constant uo is called the permeability 
of free space:!! 


Ho = 47 x 10-7 N/A’. (5.35) 


These units are such that B itself comes out in newtons per ampere-meter 
(as required by the Lorentz force law), or teslas (T):!* 


1T=1N/(A-m). (5.36) 


As the starting point for magnetostatics, the Biot-Savart law plays a 
role analogous to Coulomb’s law in electrostatics. Compare Eq. 5.34 with 
Eq. 2.6; the 1/2? dependence is common to both. Like Coulomb’s, the 
Biot—Savart law is empirical: it comes from experiment. The cross product, 
which might at first seem strange, is just right to account for the observed 
directions in Fig. 5.2. 


Example 5.5. Find the magnetic field a distance s from a long straight 
wire carrying a steady current J (Fig. 5.17). 


Solution. In the diagram, (dl’ x 2) points out of the page, and has the 
magnitude 


dl’ sina = dl’ cos @. 


Also, /’ = stan 0, so 


dl = —~_ d0, 
cos? 0 
and s = 2cos 0, so 
1 _ cos’ 0 
me gk 
Thus 
tel f® cos? 8 s 
B= ir f ( = ) (aq) 08 0 40 
I ph I 
= wf cos 6 d0 = (gin @ — sin &). (5.37) 
Ans Jo, 4rs 


11 This is an exact number, not an empirical constant. It serves (via Eq. 5.40) to define the 
ampere, and the ampere in turn defines the coulomb. 

2 For some reason, in this one case the cgs unit (the gauss) is more commonly used than the 
SI unit: 1 tesla = 10+ gauss. The Earth’s magnetic field is about half a gauss; a fairly strong 
laboratory magnetic field is, say, 1 T (10,000 gauss). 
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Wire segment 


Equation 5.37 gives the field of any straight segment of wire, in 
terms of the initial and final angles 0; and 6 (Fig. 5.18). Of course, 
a finite segment by itself could never support a steady current (where 
would the charge go when it got to the end?), but it might be a piece of 
some closed circuit, and Eq. 5.37 would then represent its contribution 
to the total field. In the case of an infinite wire, O} = -—m7/2 and 
0, = 7/2, so we obtain 

= moe (5.38) 

27s 
Notice that the field is inversely proportional to the distance from the 
wire — just like the electric field of an infinite line charge. In the region 
below the wire, B points into the page, and in general, it “circles around” 


the wire, in accordance with the right-hand rule (Fig. 5.2): 
I. 
Ree (5.39) 
271s 


Asan application, let’s find the force of attraction between two long, 
parallel wires a distance d apart, carrying currents J; and J, (Fig. 5.19). 
The field at (2) due to (1) is 


_ boli 

=o 
and it points into the page. The Lorentz force law (in the form 
appropriate to line currents, Eq. 5.17) predicts a force directed toward 


(1), of magnitude 
I 
F = a ot) a) f dl. 
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The total force, not surprisingly, is infinite, but the force per unit 
length is 


fay. (5.40) 


If the currents are antiparallel (one up, one down), the force is repulsive 
— consistent again with the qualitative observations in Section 5.1.1. 


Example 5.6. Find the magnetic field a distance z above the center of 
a circular loop of radius R, which carries a steady current J (Fig. 5.20). 


Solution. The field dB attributable to the segment dl’ points as shown. 
As we integrate dl’ around the loop, dB sweeps out a cone. The 
horizontal components cancel, and the vertical components combine, 
to give 


att, [| e 
BG) a T5 cos 0. 


(Notice that dl’ and 2 are perpendicular, in this case; the factor of cos 0 
projects out the vertical component.) Now, cos @ and 2” are constants, 
and f dl’ is simply the circumference, 27 R, so 


Lol ; cos 0 bol R 
Bz) = Ti z2 )2nR = 2 (R24 P32 (5.41) 


For surface and volume currents, the Biot-Savart law becomes 
uo ( K(r’)x2_, Ho ( Ja’)x2 ,, 
Bir) = mf ee da and B(r) = An -z2 dt > (5.42) 


respectively. You might be tempted to write down the corresponding for- 
mula for a moving point charge, using the “dictionary” (Eq. 5.30): 


Ho qvX% 
4n a 


B(r) = : (5.43) 
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but this is simply wrong.'* As I mentioned earlier, a point charge does not 
constitute a steady current, and the Biot—Savart law, which only holds for 
steady currents, does not correctly determine its field. 

The superposition principle applies to magnetic fields just as it does to 
electric fields: if you have a collection of source currents, the net field is the 
(vector) sum of the fields due to each of them taken separately. 


Problem 5.8. 


(a) Find the magnetic field at the center of a square loop, which carries a steady current Z. Let 
R be the distance from center to side (Fig. 5.21). 

(b) Find the field at the center of a regular -sided polygon, carrying a steady current Z. Again, 
let R be the distance from the center to any side. 

(c) Check that your formula reduces to the field at the center of a circular loop, in the limit 


nha wo. 


Problem 5.9. Find the magnetic field at point P for each of the steady current configurations 
shown in Fig. 5.22. 


I 
; Th BINS 
if a 
I (a) (b) 
Fig. 5.21 Fig. 5.22 


Problem 5.10. 


(a) Find the force on a square loop placed as shown in Fig. 5.23a, near an infinite straight wire. 
Both the loop and the wire carry a steady current /. 
(b) Find the force on the triangular loop in Fig. 5.23b. 


Fig. 5.23 


13 I say this loud and clear to emphasize the point of principle; actually, Eq. 5.43 is approxi- 
mately right for nonrelativistic charges (v < c), under conditions where retardation can be 
neglected (see Ex. 10.4). 
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Problem 5.11. Find the magnetic field at point P on the axis of a tightly wound solenoid 
(helical coil) consisting of 1 turns per unit length wrapped around a cylindrical tube of radius a 
and carrying current J (Fig. 5.24). Express your answer in terms of O; and 6% (it’s tidiest that 
way). Consider the turns to be essentially circular, and use the result of Ex. 5.6. What is the field 
on the axis of an infinite solenoid (infinite in both directions)? 


DO 


T 


Problem 5.12. Use the result of Ex. 5.6 to calculate the magnetic field at the center of a 
uniformly charged spherical shell, of radius R and total charge Q, spinning at constant angular 
velocity w. 


TT 


Fig. 5.24 


Problem 5.13. Suppose you have two infinite straight-line charges A, a distance d apart, 
moving along at a constant speed v (Fig. 5.25). How great would v have to be in order for the 
magnetic attraction to balance the electrical repulsion? Work out the actual number. Is this a 
reasonable sort of speed? 


x —> Vv 


5.3 The Divergence and Curl of B 


5.3.1 Straight-Line Currents 


The magnetic field of an infinite straight wire is shown in Fig. 5.26 (the 
current is coming out of the page). At a glance, it is clear that this field 
has a nonzero curl (something you'll never see in an electrostatic field); let’s 
calculate it. 

According to Eq. 5.38, the integral of B around a circular path of radius 
s, centered at the wire, is 


a ec - _ 
$s a= p aa dl = tol. 


'4 If you’ve studied special relativity, you may be tempted to look for complexities in this 
problem that are not really there — A and v are both measured in the laboratory frame, and 
this is ordinary electrostatics. 
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Loop 


Wire 


Notice that the answer is independent of s; that’s because B decreases at 

the same rate as the circumference increases. In fact, it doesn’t have to be a 

circle; any old loop that encloses the wire would give the same answer. For 

if we use cylindrical coordinates (s, @, z), with the current flowing along the 
z-axis, B = (uo7/27s)ġ and dl = ds§ + sd@ + dzĉ, so 


Hol = ot pe 
$s a= § “sag dọ = pol. 


This assumes the loop encircles the wire exactly once; if it went around 
twice, then @ would run from 0 to 47, and if it didn’t enclose the wire at all, 
then @ would go from @; to 2 and back again, with f dọ = 0 (Fig. 5.27). 

Now suppose we have a bundle of straight wires. Each wire that passes 
through our loop contributes uol, and those outside contribute nothing 
(Fig. 5.28). The line integral will then be 


$s -d= Lolenc, (5.44) 


where Knc stands for the total current enclosed by the integration path. If 
the flow of charge is represented by a volume current density J, the enclosed 
current is 


lea = fra (5.45) 


with the integral taken over any surface bounded by the loop. Applying 
Stokes’ theorem to Eq. 5.44, then, 


[vx B)-da= po f 3-da, 
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and hence (since this must hold for any loop) 
V xB = uoJ. (5.46) 


With minimal labor, we have actually obtained the general formula for 
the curl of B. But our derivation is seriously flawed by the restriction to 
infinite straight-line currents (and combinations thereof). Most current 
configurations cannot be constructed out of infinite straight wires, and we 
have no right to assume that Eq. 5.46 applies to them. So the next section 
is devoted to a formal derivation of the divergence and curl of B, starting 
from the Biot-Savart law itself. 


5.3.2 The Divergence and Curl of B 


The Biot—Savart law for the general case of a volume current reads 
uo ( Ja’)x2_, 
=— | —\—dv7’. 4 
Bar) Tr { 7 dt (5.47) 


This formula gives the magnetic field at a point r = (x, y, z) in terms of an 
integral over the current distribution J(x’, y’, z’) (Fig. 5.29). It is best to be 
absolutely explicit at this stage: 


B is a function of (x, y, Z), 
J is a function of (x’, y’, z’), 
w= (X-X)K+(V- V+ Z- 2), 
dt’ = dx’ dy’ dz’. 
The integration is over the primed coordinates; the divergence and the curl 


of B are with respect to the unprimed coordinates. 
Applying the divergence to Eq. 5.47, we obtain 


_ Ho OV ag 
vB E fv. (Ix S)ar’ (5.48) 


Invoking product rule number 6, 


a 
(VXI) -I- (Vx). (5.49) 


(yz) 


dt’ 
(x,y,z) 
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But V x J = 0, because J doesn’t depend on the unprimed variables, while 
V x (4/27) = 0 (Prob. 1.63b), so 


V-B=0. (5.50) 


Evidently, the divergence of the magnetic field is zero. 
Applying the curl to Eq. 5.47, we obtain 


_ Ho Ay, 
vxB= f [vx (Ix Sar’ (5.51) 
Again, our strategy is to expand the integrand, using the appropriate 
product rule — in this case number 8: 


â h a 

Vx (Ix 5) =I(V 5) -G-W. (5.52) 
(I have dropped terms involving derivatives of J, because J does not depend 
on x, y, z.) The second term on the right integrates to zero, as we’ll see in 
the next paragraph. The first term involves the divergence we were at pains 


to calculate in Chapter 1 (Eq. 1.100): 


â 
V (73) = 40°). (5.53) 
Thus 
VxB= £ f ransa —r')dt’ = uoJ (r), 


which confirms that Eq. 5.46 is not restricted to straight-line currents, but 
holds quite generally in magnetostatics. 

To complete the argument, however, we must check that the second term 
in Eq. 5.52 integrates to zero. Because the derivative acts only on 2/2”, we 
can switch from V to V’ at the cost of a minus sign:!> 


à nå 
EA Ye (5.54) 
The x component, in particular, is 
n(x x’ , p(x-x’) X-X _, 
(J-V')( PE Jey | PE J] -( 7 yy D 
(using product rule 5). Now, for steady currents the divergence of J is zero 
(Eq. 5.33), so 


&-x’) 


a , 
aai = [a3], 
and therefore this contribution to the integral (Eq. 5.51) can be written 
, (x — x’) 1 (x- x’) , 
{rel 7 Ija =Q TS aa. (5.55) 


(The reason for switching from V to V’ was to permit this use of the 
divergence theorem.) But what region are we integrating over? Well, it’s the 


15 The point here is that 2 depends only on the difference between the coordinates; note that 
(O/Ox)f (x — x’) = —(6/0x’)f (x - x’). 
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volume that appears in the Biot—Savart law (Eq. 5.47) — large enough, that 
is, to include all the current. You can make it bigger than that, if you like; 
J = 0 out there anyway, so it will add nothing to the integral. The essential 
point is that on the boundary the current is zero (all current is safely inside) 
and hence the surface integral (Eq. 5.55) vanishes. 16 


5.3.3 Ampere’s Law 


The equation for the curl of B, 


V xB = uoJ, (5.56) 


is called Ampére’s law (in differential form). It can be converted to integral 
form by the usual device of applying one of the fundamental theorems — in 
this case Stokes’ theorem: 


[vx B)-da= PB d= | 3-a. 


Now, f J - da is the total current passing through the surface (Fig. 5.30), 
which we call Jenc (the current enclosed by the Ampérian loop). Thus 


$ B- dl = uolens. (5.57) 


This is the integral version of Ampère’s law; it generalizes Eq. 5.44 to 
arbitrary steady currents. Notice that Eq. 5.57 inherits the sign ambiguity of 
Stokes’ theorem (Section 1.3.5): Which way around the loop am I supposed 
to go? And which direction through the surface corresponds to a “positive” 
current? The resolution, as always, is the right-hand rule: if the fingers of 
your right hand indicate the direction of integration around the boundary, 
then your thumb defines the direction of a positive current. 

Just as the Biot—Savart law plays a role in magnetostatics that Coulomb’s 
law assumed in electrostatics, so Ampére’s plays the part of Gauss’s: 


aun i Coulomb — Gauss, 


magnetostatics: Biot-Savart => Ampère. 


Boundary line > 


16 If J itself extends to infinity (as in the case of an infinite straight wire), the surface integral 


is still typically zero, though the analysis calls for greater care. 
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In particular, for currents with appropriate symmetry, Ampére’s law in 
integral form offers a lovely and extraordinarily efficient way of calculating 
the magnetic field. 


Example 5.7. Find the magnetic field a distance s from a long straight 
wire (Fig. 5.31), carrying a steady current J (the same problem we 
solved in Ex. 5.5, using the Biot—Savart law). 


Solution. We know the direction of B is “circumferential,” circling 
around the wire as indicated by the right-hand rule. By symmetry, the 
magnitude of B is constant around an Ampeérian loop of radius s, 
centered on the wire. So Ampeére’s law gives 


fB-d=Bpdl- B2Ts = Uokenc = Lol, 


_ Ho 

Si 
This is the same answer we got before (Eq. 5.38), but it was obtained 
this time with far less effort. 


Example 5.8. Find the magnetic field of an infinite uniform surface 
current K = KX, flowing over the xy plane (Fig. 5.32). 


Solution. First of all, what is the direction of B? Could it have any 
x component? No: A glance at the Biot-Savart law (Eq. 5.42) reveals 
that B is perpendicular to K. Could it have a z component? No again. 
You could confirm this by noting that any vertical contribution from a 
filament at +y is canceled by the corresponding filament at —y. But 
there is a nicer argument: Suppose the field pointed away from the 
plane. By reversing the direction of the current, I could make it point 
toward the plane (in the Biot—Savart law, changing the sign of the 
current switches the sign of the field). But the z component of B cannot 
possibly depend on the direction of the current in the xy plane. (Think 
about it!) So B can only have a y component, and a quick check with 
your right hand should convince you that it points to the left above the 
plane and to the right below it. 


Ampeérian loop 


Sheet of ce 
ES Ampèrian loop 
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With this in mind, we draw a rectangular Ampérian loop as shown 
in Fig. 5.32, parallel to the yz plane and extending an equal distance 
above and below the surface. Applying Ampére’s law, 


ps -dl=2Bl= Holene = uoKl, 
(one BI comes from the top segment and the other from the bottom), 


so B = (uo/2)K, or, more precisely, 


: o for z < 0, (5.58) 


—(uo/2)Ky forz>0. 


Notice that the field is independent of the distance from the plane, just 
like the electric field of a uniform surface charge (Ex. 2.5). 


Example 5.9. Find the magnetic field of a very long solenoid, consist- 
ing of n closely wound turns per unit length on a cylinder of radius R, 
each carrying a steady current J (Fig. 5.33). [The point of making the 
windings so close is that one can then pretend each turn is circular. If 
this troubles you (after all, there is a net current 7 in the direction of 
the solenoid’s axis, no matter how tight the winding), picture instead 
a sheet of aluminum foil wrapped around the cylinder, carrying the 
equivalent uniform surface current K = nI (Fig. 5.34). Or make a 
double winding, going up to one end and then — always in the same 
sense — going back down again, thereby eliminating the net longitudi- 
nal current. But, in truth, this is all unnecessary fastidiousness, for the 
field inside a solenoid is huge (relatively speaking), and the field of the 
longitudinal current is at most a tiny refinement.] 


Solution. First of all, what is the direction of B? Could it have a 
radial component? No. For suppose B, were positive; if we reversed 
the direction of the current, B, would then be negative. But switching 
I is physically equivalent to turning the solenoid upside down, and that 
certainly should not alter the radial field. How about a “circumferen- 
tial” (azimuthal) component? No. For By would be constant around 
an Ampeérian loop concentric with the solenoid (Fig. 5.35), and hence 


ps -dl= By (273) = Lolenc = 0, 


since the loop encloses no current. 


I l 
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LO 


N i 


Ampèrian loo 
i P Ampeérian loops 


So the magnetic field of an infinite, closely wound solenoid runs 
parallel to the axis. From the right-hand rule, we expect that it points 
upward inside the solenoid and downward outside. Moreover, it cer- 
tainly approaches zero as you go very far away. With this in mind, let’s 
apply Ampére’s law to the two rectangular loops in Fig. 5.36. Loop 1 
lies entirely outside the solenoid, with its sides at distances a and b from 
the axis: 


ps -dl= [B(a) = Bb)|L E Holenc =0, 


B(a) = Bib). 


Evidently, the field outside does not depend on the distance from the 
axis. But we agreed that it goes to zero for large s. It must therefore be 
zero everywhere! (This astonishing result can also be obtained directly 
from the Biot—Savart law, of course, but it’s much more difficult. See 
Prob. 5.49.) 

As for loop 2, which is half inside and half outside, Ampére’s law 
gives 


ps -dì= BL = Lolenc = HonIL, 


where B is the field inside the solenoid. (The right-hand side of the 
loop contributes nothing, since B = 0 out there.) Conclusion: 


: : (5.59) 
0 outside the solenoid. 


B= — Ż inside the solenoid, 


Notice that the field inside is uniform—it doesn’t depend on the distance 
from the axis. In this sense the solenoid is to magnetostatics what 
the parallel-plate capacitor is to electrostatics: a simple device for 
producing strong uniform fields. 


Like Gauss’s law, Ampére’s law is always true (for steady currents), but 
it is not always useful. Only when the symmetry of the problem enables 
you to pull B outside the integral $ B - dl can you calculate the magnetic 
field from Ampère’s law. When it does work, it’s by far the fastest method; 
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when it doesn’t, you have to fall back on the Biot—Savart law. The current 
configurations that can be handled by Ampére’s law are: 


(1) infinite straight lines (prototype: Ex. 5.7), 
(2) infinite planes (prototype: Ex. 5.8), 

(3) infinite solenoids (prototype: Ex. 5.9), 
(4) toroids (prototype: Ex. 5.10). 


The last of these is a surprising and elegant application of Ampére’s 
law. As in Exs. 5.8 and 5.9, the hard part is figuring out the direction 
of the field (which we will now have done, once and for all, for each of 
the four geometries); the actual application of Ampére’s law takes only 
one line. 


Example 5.10. A toroidal coil consists of a circular ring, or “donut,” 
around which a long wire is wrapped (Fig. 5.37). The winding is 
uniform and tight enough so that each turn can be considered a 
plane closed loop. The cross-sectional shape of the coil is immaterial. 
I made it rectangular in Fig. 5.37 for the sake of simplicity, but 
it could just as well be circular or even some weird asymmetrical 
form, as in Fig. 5.38, as long as the shape remains the same 
all the way around the ring. In that case, the magnetic field of 
the toroid is circumferential at all points, both inside and outside 
the coil. 


Proof. According to the Biot-Savart law, the field at r due to 
the current element at r’ is 


We may as well put r in the xz plane (Fig. 5.38), so its Cartesian 
components are (x, 0, z), while the source coordinates are 


r’ = (s cos’, s’ sing’, 2’). 
Then 
a= (x -s cos ¢’,-s’ sing’,z—- 2’). 


Since the current has no @ component, I = 1,8 + I2, or (in 
Cartesian coordinates) 


I= (I; cos p’, I, sin Q, L). 
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Accordingly, 
x y Ż 
Ix2z= I, cos p’ I, sin ġo’ L, 


(x-s’ cos’) (-s’ sing’) (z-2') 
[sin p’ (,(z — 2’) + s L) + M(x- s’ cos 0’) — I, cos f’(z - 2’) 


+ [-L,xsin ’]z. 


But there is a symmetrically located current element at r”, with 
the same s’, the same +, the same dl’, the same /,, and the same 
I-, but negative $' (Fig. 5.38). Because sin d’ changes sign, the x 
and z contributions from r’ and r” cancel, leaving only a y term. 
Thus the field at r is in the y direction, and in general the field 
points in the @ direction. Oo 


Now that we know the field is circumferential, determining its 
magnitude is ridiculously easy. Just apply Ampére’s law to a circle of 
radius s about the axis of the toroid: 


B2ms = Holencs 


and hence 


EWE o: Hobpomiemide thecal 
r points insi i 
Bir) =) 2ns @ for points inside the coil, 


0 for points outside the coil, 


(5.60) 


where N is the total number of turns. 
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Problem 5.14. A steady current Z flows down a long cylindrical wire of radius a (Fig. 5.39). 
Find the magnetic field, both inside and outside the wire if: 


(a) the current is uniformly distributed over the surface of the wire; 
(b) the current is distributed in such a way that J is proportional to s, the distance from the 
axis. 


Problem 5.15. A thick slab extending from z = —a to z = +a (and infinite in the x and 
y directions) carries a uniform volume current J = J$ (Fig. 5.40). Find the magnetic field, as 
a function of z, both inside and outside the slab. 


zh 
ae 
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Fig. 5.39 Fig. 5.40 


Problem 5.16. Consider the current sheet in Ex. 5.8. A charged particle moving inthe xz plane 
above the current sheet (where the field is uniform) executes cyclotron motion. But what if 
its trajectory crosses the xy plane? Describe the resulting motion, and calculate the average 
(“drift”) velocity.” 


Problem 5.17. Two long coaxial solenoids each carry current 7, but in opposite directions, as 
shown in Fig. 5.41. The inner solenoid (radius a) has 7; turns per unit length, and the outer one 
(radius b) has 72. Find B in each of the following three regions: (i) inside the inner solenoid, 
(ii) between them, and (iii) outside both. 


O — | 


Fig. 5.41 Fig. 5.42 


Problem 5.18. A large parallel-plate capacitor with uniform surface charge o on the upper 
plate and —o on the lower plate is moving with a constant speed v, as shown in Fig. 5.42. 


(a) Find the magnetic field between the plates and also above and below them. 
(b) Find the magnetic force per unit area on the upper plate, including its direction. 
(c) At what speed v would the magnetic force balance the electrical force?"® 


17 For commentary see H. Essén and A. B. Nordmark, Eur. Phys. J. D 70, 198 (2016). 
18 See footnote to Prob. 5.13. 
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Problem 5.19. Show that the magnetic field of an infinite solenoid runs parallel to the axis, 
regardless of the cross-sectional shape of the coil, as long as that shape is constant along the 
length of the solenoid. What is the magnitude of the field, inside and outside such a coil? Show 
that the toroid field (Eq. 5.60) reduces to the solenoid field, when the (large) radius of the donut 
is so much greater than the small radius that a segment can be considered essentially straight. 


Problem 5.20. In calculating the current enclosed by an Ampèrian loop, one must, in general, 
evaluate an integral of the form 
lene = I J+ da. 
S 


Trouble is, there are infinitely many surfaces that share the same boundary line. Which one are 
we supposed to use? 


5.3.4 Comparison of Magnetostatics and Electrostatics 


The divergence and curl of the electrostatic field are 


V- E= p/eọ (Gauss’s law), 
VxE=0 (no name). 


These are Maxwell’s equations for electrostatics. Together with the bound- 
ary condition E — 0 far from all charges,!? Maxwell’s equations determine 
the field, if the source charge density p is given; they contain essentially the 
same information as Coulomb’s law plus the principle of superposition. 
The divergence and curl of the magnetostatic field are 


V-B=0 (no name), 
VxB= uoJ (Ampère’s law). 


These are Maxwell’s equations for magnetostatics. Again, together with 
the boundary condition B — 0 far from all currents (or symmetry 
considerations), Maxwell’s equations determine the magnetic field; they are 
equivalent to the Biot—Savart law (plus superposition). Maxwell’s equations 
and the force law 


F= Q(E+vxB) 


constitute the most elegant formulation of electrostatics and magnetostatics. 

Intrinsically, electric forces are enormously stronger than magnetic ones. 
That’s not something obvious from the structure of the theory; it has to 
do with the sizes of the constants €o and uo, and the relatively modest 
particle speeds typically involved. In general, it is only when both the source 
charges and the test charge are moving at velocities comparable to the speed 


19 In those artificial problems where the charge (or current) extends to infinity — infinite 
planes, for example — symmetry considerations can sometimes take the place of boundary 
conditions. 
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of light that the magnetic force approaches the electric force in strength. 
(Problems 5.13 and 5.18 illustrate this rule.) How is it, then, that we notice 
magnetic effects at all? The answer is that both in the production of a 
magnetic field (Biot-Savart) and in its detection (Lorentz), it is the current 
that matters, and we can compensate for a smallish velocity by pouring 
prodigious amounts of charge down the wire. Ordinarily, this charge would 
simultaneously generate so large an electric force as to swamp the magnetic 
one. But if we arrange to keep the wire neutral, by embedding in it an equal 
quantity of opposite charge at rest, the electric field cancels out, leaving the 
magnetic force to stand alone. It sounds very elaborate, but of course this 
is precisely what happens in an ordinary current-carrying wire (Prob. 5.21). 

The electric field diverges away from a (positive) charge; the magnetic 
field curls around a current (Fig. 5.43). There are no stationary point sources 
for B — no magnetic analogs to electric charge. This is the physical content 
of the statement V- B = 0. Coulomb and others believed that magnetism is 
produced by magnetic charges (magnetic monopoles, as we would now call 
them), and in some older books you will still find references to a magnetic 
version of Coulomb’s law, giving the force of attraction or repulsion 
between “north” and “south” poles. It was Ampère who first speculated that 
all magnetic effects are attributable to electric charges in motion (currents). 
As far as we know, Ampère was right, though people still search for 
magnetic monopoles (they are obviously pretty rare, or somebody would 
have found one by now”), and some recent elementary particle theories 
actually require their existence. In classical electrodynamics, however, B is 
divergenceless, and there are no magnetic monopoles. It takes (moving) 
electric charges to produce a magnetic field, and it takes a moving electric 
charge to “feel” a magnetic field. 

In electrostatics we found it illuminating to connect up the field vectors 
to form field lines; the tangent to the field line points in the direction of the 


20 An apparent detection (B. Cabrera, Phys. Rev. Lett. 48, 1378 (1982)) has never been 
reproduced — and not for want of trying. For a delightful brief history of ideas about 
magnetism, read Chap. 1 in D. C. Mattis, The Theory of Magnetism, Harper & Row, New 
York (1965). 
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OK 
(a) (b) 


— a, 


field, and the density of field lines (the number per unit cross-sectional area) 
is proportional to the magnitude of the field. The same device helps us to 
visualize magnetic fields; thus the fields in Fig. 5.43 become the field Zines 
in Fig. 5.44. Electric field lines originate on positive charges and terminate 
on negative ones (either that, or they extend out to infinity). Magnetic field 
lines do not begin or end anywhere. In the examples we have encountered 
they form closed loops (or extend to infinity). The direction of the field 
line still tells you the direction of the field ... but is the density of magnetic 
field lines proportional to |B|? Yes it is — or rather, we can always choose to 
make it so. 


Proof. Imagine a flux tube — a bundle of field lines, sliced off 
perpendicularly at the two ends (Fig. 5.45). The tube is skinny enough 
that the field is essentially constant over each end. The surface integral 
of B over this tube is 


È B- da = Boay~ Byay = | V-Bdr =. 
Ss v 


Thus the flux of B is constant along a given flux tube. If at the “start 
(end 1) we have drawn a number of field lines proportional to the flux 
(n = «By ay, or Bı = n/(xka,)) then B is proportional to the number of 
field lines per unit area all along the tube. Doing the same for every 
flux tube (with the same proportionality factor x), the density of field 
lines represents the strength of the field everywhere. o 


3 


But there’s a dirty little secret about magnetic field lines that you’re not 
supposed to talk about in public. In many simple problems (the infinite wire, 
the circular current ring, the solenoid, the toroidal coil, ...) the field lines 
form closed loops. Indeed, you will often read that magnetic fields always 
form closed loops. Unfortunately, this is the exception, not the rule, and it 
depends in each case on a special symmetry of the source. Consider, for 
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(a) (b) 


instance, a circular ring carrying a steady current T: the field lines form 
closed oval-shaped loops that circle the wire (Fig. 5.46a). 

But suppose we now introduce a long straight wire along the z-axis, with 
an adjustable current 7’. This will generate an azimuthal field (Fig. 5.46b) 
that, combined with the field of the ring, yields a pattern of helical field 
lines that wind around the ring like a slinky (Fig. 5.47). Except for certain 
very special values of the currents, the “helix” will not close on itself 
after circling the ring. Indeed, it will wind around and around the ring, 
weaving a donut-shaped sheath — all from a single field line (see Prob. 5.68). 
So much for the notion that the density of field lines represents the 
magnitude of the field! And this is not an unusual case; it’s the typical 
behavior of magnetic field lines.7! What’s gone wrong, and can it be 
fixed? 

Let’s go back to the flux tube in Fig. 5.45. We start with a bunch of 
points on the “entering” patch a), construct the field lines that issue from 
them, and track those lines out to a2. These are the field lines inside the 
flux tube... but if we continue to follow one of those lines out past az, 
it wanders around (outside the tube) and eventually, perhaps, finds itself 
back at a. If it happens to hit one of our starting points, great: it won’t 
change the total number. But what if it lands between the dots? When it 
enters the flux tube we will suddenly have too many field lines to represent 
the strength of the field. What is to be done? Either we abandon the idea 
that the density of field lines represents the magnitude of the field (in which 
case we sacrifice the main point of the field-line picture) or else we must 
stop the errant line at the gate, and identify its continuation as one of 
the lines we already drew. As long as just as many field lines exit a, as 
entered it, that won’t violate V - B = 0... but they are discontinuous at the 


21 See J. Slepian, Am. J. Phys. 19, 87 (1951); M. Lieberherr, Am. J. Phys. 78, 1117 (2010); L. 
Zilberti, IEEE Magn. Lett. 8, 1306005 (2017). See also Prob. 5.67. 
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boundary — they lose their identity, and hop over to an existing field line. 
(The field line in Fig. 5.47 makes one circuit of the loop, and then jumps 
over to rejoin its tail.) 

This complication vitiates the utility of field lines for any but the very 
simplest examples (which is why we don’t like to talk about it). I guess 
the moral of the story is not to take magnetic?” field lines too literally. 
They work within finite neighborhoods, but not globally. Faraday, who 
introduced the concept, thought of field lines as real physical entities — 
invisible wires (or spider webs) that exist out there in space (to this day 
that is how most electrical engineers think of them). If you really want 
to sustain this picture, you’re going to have to live with magnetic field 
lines that are discontinuous at places that are entirely arbitrary. I prefer 
to say that field lines are not physically real; they can sometimes be useful 
as visual aids, but don’t trust them with serious money. At any rate, in 
Maxwell’s formulation, field lines play no role: the “real” physical quantities 
are the fields — vectors at each point — and field Zines are nothing but 
cartoons. 


Problem 5.21. 


(a) Find the density o of mobile charges in a piece of copper, assuming each atom contributes 
one free electron. [Look up the necessary physical constants.] 

(b) Calculate the average electron velocity in a copper wire 1 mm in diameter, carrying a current 
of 1 A. [Note: This is literally a snail’s pace. How, then, can you carry on a long distance 
telephone conversation?] 

(c) What is the force of attraction between two such wires, 1 cm apart? 

(d) If you could somehow remove the stationary positive charges, what would the electrical 
repulsion force be? How many times greater than the magnetic force is it? 


Problem 5.22. Is Ampère's law consistent with the general rule (Eq. 1.46) that divergence-of- 
curl is always zero? Show that Ampère’s law cannot be valid, in general, outside magnetostatics. 
Is there any such “defect” in the other three Maxwell equations? 


Problem 5.23. Suppose there did exist magnetic monopoles. How would you modify 
Maxwell's equations and the force law to accommodate them? If you think there are several 
plausible options, list them, and suggest how you might decide experimentally which one 
is right. 


Problem 5.24. Why doesn’t the “dirty little secret” apply to electrostatics? 


22 Electrostatic field lines do not suffer from this defect. See Prob. 5.24. 
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5.4 Magnetic Vector Potential 


ail 
5.4.1 The Vector Potential 


Just as V x E=0 permitted us to introduce a scalar potential (V) in electro- 
statics, 


E=-VI, 


so V-B = 0 invites the introduction of a vector potential A in magnetostatics: 


B=VxA. (5.61) 


The former is authorized by Theorem 1 (of Section 1.6.2), the latter by 
Theorem 2 (the proof of Theorem 2 is developed in Prob. 5.33). The 
potential formulation automatically takes care of V- B = 0 (since the 
divergence of a curl is always zero); there remains Ampére’s law: 


V xB = V x (V x A) = V(V-A)-V°A = uoJ. (5.62) 


Now, the electric potential had a built-in ambiguity: you can add to V 
any function whose gradient is zero (which is to say, any constant), without 
altering the physical quantity E. Likewise, you can add to A any function 
whose curl vanishes (which is to say, the gradient of any scalar), with no 
effect on B. We can exploit this freedom to eliminate the divergence of A: 


V-A=0. (5.63) 


To prove that this is always possible, suppose that our original potential, 
Ao, is not divergenceless. If we add to it the gradient of A (A = Ag + VA), 
the new divergence is 


V-A=V-Ajg+V-A. 
We can accommodate Eq. 5.63, then, if a function A can be found that 
satisfies 
VA =-V- Apo. 


But this is mathematically identical to Poisson’s equation (2.24), 


Treb, 
€0 


with V - Ao in place of p/eo as the “source.” And we know how to solve 
Poisson’s equation — that’s what electrostatics is all about (“given the charge 
distribution, find the potential”). In particular, if p goes to zero at infinity, 


the solution is Eq. 2.29: 
1 
V= { £ dt’, 
4TE9 YA 
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and by the same token, if V - Ag goes to zero at infinity, then 


1 V-A 
daz f 0 dr’. 
4r a 


If V - Ao does not go to zero at infinity, we'll have to use other means 
to discover the appropriate A, just as we get the electric potential by 
other means when the charge distribution extends to infinity. But the 
essential point remains: it is always possible to make the vector potential 
divergenceless. To put it the other way around: the definition B = V x A 
specifies the curl of A, but it doesn’t say anything about the divergence — we 
are at liberty to pick that as we see fit, and zero is ordinarily the simplest 
choice. 
With this condition on A, Ampére’s law (Eq. 5.62) becomes 


V°A = - uoJ. (5.64) 


This again is nothing but Poisson’s equation — or rather, it is three Poisson 
equations, one for each Cartesian? component. Assuming J goes to zero 
at infinity, we can read off the solution: 


_ Ho J(r’) 
j= z d (5.65) 


For line and surface currents, 


A= [> ar = Hf dV, a= Bo da’. (5.66) 


(If the current does not go to zero at infinity, we have to find other ways to 
get A; some of these are explored in Ex. 5.12 and in the problems at the end 
of the section.) 

It must be said that A is not as useful as V. For one thing, it’s still a vector, 
and although Eqs. 5.65 and 5.66 are somewhat easier to work with than the 
Biot—Savart law, you still have to fuss with components. It would be nice if 
we could get away with a scalar potential 


B=-VU, (5.67) 


but this is incompatible with Ampére’s law, since the curl of a gradient 
is always zero. (A magnetostatic scalar potential can be used, if you stick 
scrupulously to simply connected, current-free regions, but as a theoretical 
tool, it is of limited interest. See Prob. 5.31.) Moreover, since magnetic 
forces do no work, A does not admit a simple physical interpretation in 
terms of potential energy per unit charge. (In some contexts it can be 


23 Tn Cartesian coordinates, VA = (V2A,)X + (V7 Ay)9 + (V?A-)2, so Eq. 5.64 reduces to 
V7Ay = —Hodx, V7Ay = -UoJy, and V7A, = —oJ-. In curvilinear coordinates the unit 
vectors themselves are functions of position, and must be differentiated, so it is not the 
case, for example, that V7.4, = —oJ;. Remember that even if you plan to evaluate integrals 
such as 5.65 using curvilinear coordinates, you must first express J in terms of its Cartesian 
components (see Section 1.4.1). 
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interpreted as momentum per unit charge.”*) Nevertheless, the vector poten- 
tial has substantial theoretical importance, as we shall see in Chapter 10. 


Example 5.11. A spherical shell of radius R, carrying a uniform sur- 
face charge o, is set spinning at angular velocity w. Find the vector 
potential it produces at point r (Fig. 5.48). 


A 


oc] 


Solution. It might seem natural to set the polar axis along w, but in 
fact the integration is easier if we let r lie on the z-axis, so that w is 
tilted at an angle y. We might as well orient the x-axis so that w lies 
in the xz plane, as shown in Fig. 5.49. According to Eq. 5.66, 


aw E E2 u, 


where K = øv, 2 = VR? + r2 — 2Rr cos 0’, and da’ = R? sin 0’ dQ’ dd’. 
Now the velocity of a pointr’ in a rotating rigid body is given by w xr’; 
in this case, 


x y Z 
v=wxr = w sin Y 0 w cos Y 
Rsin 0’ cos’ RsinO’sing’ R cos 0’ 
= Rw[-(cos y sin 0’ sin d’)X + (cos y sin 0’ cos d’ — sin y cos 8’)y 
+ (sin Y sin 0’ sin @’)z]. 


Notice that each of these terms, save one, involves either sin d’ or 


cos Q’. Since 
2n 2m 
f sin ġọ' dd’ = f cos ġ'do’ = 0, 
0 0 


such terms contribute nothing. There remains 


A(r) = 


uoRowsiny/ f" cos 9’ sin 0’ As 
E 3 ( 2 J 
0 VR2 +72 —2Rrcos 0’ 


24 M. D. Semon and J. R. Taylor, Am. J. Phys. 64, 1361 (1996). 
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Letting u = cos 0’, the integral becomes 


+1 24 42 

R R 

f 2 PEE. ais a R? +r? —2Rru 
-1 VR? +r? —2Rru 3R*r 


+1 


1 
= “se +r + Rr)|R -r| 
- (R? +1? — Rr)\(R + r)]. 
If the point r lies inside the sphere, then R > r, and this expression 


reduces to (2r/3R?); if r lies outside the sphere, so that R < r, it reduces 
to (2R/3r°). Noting that (w x r) = -wr sin y f, we have, finally, 


HoR ; bo 
(w xr) for points inside the sphere, 
A(r) = Ro (5.68) 
n 3 (wxr) for points outside the sphere. 
r 


Having evaluated the integral, I revert to the “natural” coordinates 
of Fig. 5.48, in which w coincides with the z-axis and the point r is at 
(r, 0, Q): 

uoRwo 


rsinO@ (r< R), 


A(r,0,ġ) = (5.69) 


Rtwo sin@ » 
a > (eR). 


Curiously, the fie/d inside this spherical shell is uniform: 


2uoR 
B= yx A = EOT 


$ 2 2 
(cos OF — sin 0 0) = zoo Rw = zHooRw. 
(5.70) 


Example 5.12. Find the vector potential of an infinite solenoid with n 
turns per unit length, radius R, and current 7. 


Solution. This time we cannot use Eq. 5.66, since the current itself 
extends to infinity. But here’s a cute method that does the job. Notice 


that 
pa-d= {(xa)-da= f B-da=o, (5.71) 


where © is the flux of B through the loop in question. This is reminis- 
cent of Ampére’s law in integral form (Eq. 5.57), 


$B -dl = Holenc: 


In fact, it’s the same equation, with B — A and olen > ®. 
If symmetry permits, we can determine A from ® in the same way 
we got B from knc, in Section 5.3.3. The present problem (with a 
uniform longitudinal magnetic field uon inside the solenoid and no 
field outside) is analogous to the Ampére’s law problem of a fat 
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wire carrying a uniformly distributed current. The vector potential is 
“circumferential” (it mimics the magnetic field in the analog); using a 
circular “Ampérian loop” at radius s inside the solenoid, we have 


pa -dl = A(2m8s) = i B- da = onI (ns), 


so 
uont 
2 


For an Ampèrian loop outside the solenoid, the flux is 


A= sp fors<R. (5.72) 


fe - da = onI (nR?), 


since the field only extends out to R. Thus 


pont R. 


A= forsa R. (5.73) 


If you have any doubts about this answer, check it: Does V x A = B? 
Does V- A = 0? If so, we're done.2° 


Typically, the direction of A mimics the direction of the current. For 
instance, both were azimuthal in Exs. 5.11 and 5.12. Indeed, if all the 
current flows in one direction, then Eq. 5.65 suggests that A must point that 
way too. Thus the potential of a finite segment of straight wire (Prob. 5.25) 
is in the direction of the current. Of course, if the current extends to infinity 
you can’t use Eq. 5.65 in the first place (see Probs. 5.28 and 5.29). Moreover, 
you can always add an arbitrary constant vector to A — this is analogous to 
changing the reference point for V, and it won’t affect the divergence or 
curl of A, which is all that matters (in Eq. 5.65 we have chosen the constant 
so that A goes to zero at infinity). In principle you could even use a vector 
potential that is not divergenceless, in which case all bets are off. Despite 
these caveats, the essential point remains: ordinarily the direction of A will 
match the direction of the current. 


Problem 5.25. Find the magnetic vector potential of a finite segment of straight wire carrying 
a current Z. [Put the wire on the z-axis, from z; to z>, and use Eq. 5.66.] Check that your answer 
is consistent with Eq. 5.37. 


Problem 5.26. 


(a) What current density would produce the vector potential A = kọ (where kis a constant), 
in cylindrical coordinates? 


25 Notice that A is nonzero outside the solenoid, even though B (= V x A) vanishes there. 
In classical electrodynamics only B can exert a magnetic force on a charged particle (via 
the Lorentz force law). In the guantum theory, by striking contrast, the magnetic influence 
on a charge is expressed in terms of A, not B, and an electron passing nearby but outside 
the solenoid experiences a change in the phase of its wave function, which can be detected 
in the famous Aharonov-Bohm effect. (See, for instance, D. J. Griffiths and D. F. Schroeter, 
Introduction to Quantum Mechanics, 3rd ed., Cambridge University Press (2018), Sect. 4.5.) 
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(b) Consider an azimuthally symmetric magnetic field; it points in the z direction, and its 
magnitude is a function only of s. Check that 


A = A(s)þb, where A(s) = 1 I T BNS ds’, 
0 


S 


by calculating its divergence and curl. (This generalizes Ex. 5.12.) 


Problem 5.27. If B is uniform, show that A (r) = — i x B) works. That is, check that 
V-A=OandV x A = B. Is this result unique, or are there other functions with the same 
divergence and curl? 


Problem 5.28. 


(a) By whatever means you can think of (short of looking it up), find the vector potential a 
distance s from an infinite straight wire carrying a current Z. Check that V - A = O and 
VxA=B. 

(b) Find the magnetic potential inside the wire, if it has radius R and the current is uniformly 
distributed. 


Problem 5.29. Find the vector potential above and below the plane surface current in Ex. 5.8. 


Problem 5.30. 


(a) Check that Eq. 5.65 is consistent with Eq. 5.63, by applying the divergence. 
(b) Check that Eq. 5.65 is consistent with Eq. 5.47, by applying the curl. 
(c) Check that Eq. 5.65 is consistent with Eq. 5.64, by applying the Laplacian. 


Problem 5.31. Suppose you want to define a magnetic scalar potential U (Eq. 5.67) in the 
vicinity of a current-carrying wire. First of all, you must stay away from the wire itself (there 
V x B + 0); but that’s not enough. Show, by applying Ampère’s law to a path that starts at a 
and circles the wire, returning to b (Fig. 5.50), that the scalar potential cannot be single valued 
(that is, U(a) + U (b), even if they represent the same physical point). As an example, find 
the scalar potential for an infinite straight wire. (To avoid a multivalued potential, you must 
restrict yourself to simply connected regions that remain on one side or the other of every wire, 
never allowing you to go all the way around.) 


Ampeérian loop 


SS 


Fig. 5.50 


if 
Da 


Problem 5.32. Use the results of Ex. 5.11 to find the magnetic field inside a solid sphere, of 
uniform charge density p and radius R, that is rotating at a constant angular velocity w. 
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Problem 5.33. 


(a) Complete the proof of Theorem 2, Section 1.6.2. That is, show that any divergenceless vector 
field F can be written as the curl of a vector potential A. What you have to do is find Ax, 
Ay, and A- such that (i) 0A_/0y — 0A,/0z = Fy, (ii) 0A,/dz — 0A-/0x = Fy, 
and (iii) 0,/0x — ð Axlðy = Fz. Here's one way to do it: Pick Ax = 0, and solve (ii) 
and (iii) for A, and 4-. Note that the “constants of integration” are themselves functions 
of y and z — they're constant only with respect to x. Now plug these expressions into (i), 
and use the fact that V - F = O to obtain 


x y x 
Te ii ee Í F,(0,y',2) dy'— i F, y, 2) dx’. 


By direct differentiation, check that the A you obtained in part (a) satisfies V x A = F. Is 
A divergenceless? [This was a very asymmetrical construction, and it would be surprising if 


= 
a 
= 


it were — although we know that there exists a vector whose curl is F and whose divergence 
is zero.] 

(c) Asan example, let F = yx + zy + xz. Calculate A, and confirm that V x A = F. (For 
further discussion, see Prob. 5.56.) 


5.4.2 Boundary Conditions 


In Chapter 2, I drew a triangular diagram to summarize the relations 
among the three fundamental quantities of electrostatics: the charge density 
p, the electric field E, and the potential V. A similar figure can be con- 
structed for magnetostatics (Fig. 5.51), relating the current density J, the 
field B, and the potential A. There is one “missing link” in the diagram: 
the equation for A in terms of B. It’s unlikely you would ever need such a 
formula, but in case you are interested, see Probs. 5.55 and 5.56. 
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Just as the electric field suffers a discontinuity at a surface charge, so 
the magnetic field is discontinuous at a surface current. Only this time 
it is the tangential component that changes. For if we apply Eq. 5.50, in 


integral form, 
$ B-da=0, 


to a wafer-thin pillbox straddling the surface (Fig. 5.52), we get 


Bt, = BE 


above below’ 


(5.74) 


As for the tangential components, an Ampérian loop running perpendicu- 
lar to the current (Fig. 5.53) yields 


fs “dl = Bie T Bl sow)! = HMolenc = HoKI, 
or 


B! Bl = uK. (5.75) 


above below 
Thus the component of B that is parallel to the surface but perpendicular 
to the current is discontinuous in the amount uoK. A similar Ampèrian 
loop running parallel to the current reveals that the parallel component is 
continuous. These results can be summarized in a single formula: 


Babove — Buelow = Ho(K xn), (5.76) 


where ñ is a unit vector perpendicular to the surface, pointing “upward.” 
Like the scalar potential in electrostatics, the vector potential is contin- 
uous across any boundary: 


Aabove = Abelow> (5.77) 
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for V- A = 0 guarantees” that the normal component is continuous, and 
V x A = B, in the form 


fa-a= [{ B-da=o, 


means that the tangential components are continuous (the flux through 
an Ampèrian loop of vanishing thickness is zero). But the derivative of A 
inherits the discontinuity of B: 


OAabove OAbvelow = 
a Te (5.78) 


Problem 5.34. 


(a) Check Eq. 5.76 for the configuration in Ex. 5.9. 
(b) Check Eqs. 5.77 and 5.78 for the configuration in Ex. 5.11. 


Problem 5.35. Prove Eq. 5.78, using Eqs. 5.63, 5.76, and 5.77. [Suggestion: I'd set up Cartesian 
coordinates at the surface, with z perpendicular to the surface and x parallel to the current.] 


5.4.3 Multipole Expansion of the Vector Potential 


If you want an approximate formula for the vector potential of a localized 
current distribution, valid at distant points, a multipole expansion is in 
order. Remember: the idea of a multipole expansion is to write the potential 
as a power series in l/r, where r is the distance to the point in question 
(Fig. 5.54); if ris sufficiently large, the series will be dominated by the lowest 
nonvanishing contribution, and the higher terms can be ignored. As we 
found in Eq. 3.94, 


1 l = 2- )"P,(cos a), (5.79) 


2 Jr? + (7)? — 2rr’ cosa 2 


dr’ =dl' 


26 Note that Eqs. 5.77 and 5.78 presuppose that A is divergenceless. 
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where a is the angle between r and r’. Accordingly, the vector potential of 
a current loop can be written 


O pol C1 ,_ Hol < Ler 
A(r) = an fia =n 2, aa (r’)" P,(cos a) dl’, (5.80) 


or, more explicitly, 


I 
Aer) = BE pasz fr cose 
4n lr r2? 
1 K2 3 2 1 , 
+ Por? (5 e0s*a - 5) av +--+]. (5.81) 


As in the multipole expansion of V, we call the first term (which goes like 
1/r) the monopole term, the second (which goes like 1/7?) dipole, the third 
quadrupole, and so on. 

Now, the magnetic monopole term is always zero, for the integral is just 
the total vector displacement around a closed loop: 


par =0. (5.82) 
This reflects the fact that there are no magnetic monopoles in nature (an 
assumption contained in Maxwell’s equation V - B = 0, on which the entire 
theory of vector potential is predicated). 


In the absence of any monopole contribution, the dominant term is the 
dipole (except in the rare case where it, too, vanishes): 


I I 
Agip(t) = as $ r’ cosa dl’ = a $ (@-r)dV. (5.83) 


This integral can be rewritten in a more illuminating way if we invoke 


Eq. 1.108, with c =r: 
fe -r’) dl’ = -f x f da’. (5.84) 


Then 
_ Homxf 
Agdip (1) = an 72? (5.85) 
where m is the magnetic dipole moment: 
m= rf da = Ia. (5.86) 


Here ais the “vector area” of the loop (Prob. 1.62); if the loop is flat, a is the 
ordinary area enclosed, with the direction assigned by the usual right-hand 
rule (fingers in the direction of the current). 
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Example 5.13. Find the magnetic dipole moment of the “bookend- 
shaped” loop shown in Fig. 5.55. All sides have length w, and it carries 
a current J. 


Solution. This wire could be considered the superposition of two 
plane square loops (Fig. 5.56). The “extra” sides (4B) cancel when the 
two are put together, since the currents flow in opposite directions. The 
net magnetic dipole moment is 


m = Iw’y + Iw’a; 


its magnitude is V27w?, and it points along the 45° line z = y. 


It is clear from Eq. 5.86 that the magnetic dipole moment is indepen- 
dent of the choice of origin. You may remember that the electric dipole 
moment is independent of the origin only when the total charge vanishes 
(Section 3.4.3). Since the magnetic monopole moment is always zero, it is 
not too surprising that the magnetic dipole moment is always independent 
of origin. 

Although the dipole term dominates the multipole expansion (unless 
m=0) and thus offers a good approximation to the true potential, it is 
not ordinarily the exact potential; there will be quadrupole, octopole, and 
higher contributions. You might ask, is it possible to devise a current 
distribution whose potential is “pure” dipole — for which Eq. 5.85 is exact? 
Well, yes and no: like the electrical analog, it can be done, but the model 
is a bit contrived. To begin with, you must take an infinitesimally small 
loop at the origin, but then, in order to keep the dipole moment finite, you 
have to crank the current up to infinity, with the product m = Ja held fixed. 
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Zh ZA 


oT 


Sy 


(a) Field of a “pure” dipole (b) Field of a “physical” dipole 


In practice, the dipole potential is a suitable approximation whenever the 
distance r greatly exceeds the size of the loop. 

The magnetic field of a (perfect) dipole is easiest to calculate if we put 
m at the origin and let it point in the z direction (Fig. 5.57). According to 
Eq. 5.85, the potential at point (r, 0, d) is 


7 Ho msin 0g 
Adip (1) = 4n r2 Q, (5.87) 
and hence 
Hom 4 z A 
Bapt) =VxA= Inr (2 cos 0f + sin 0 0). (5.88) 


Surprisingly, this is identical in structure to the field of an electric dipole 
(Eq. 3.103)! (Up close, however, the field of a physical magnetic dipole — a 
small current loop — looks quite different from the field of a physical electric 
dipole — plus and minus charges a short distance apart. Compare Fig. 5.58 
with Fig. 3.37.) 


Problem 5.36. Show thatthe magnetic field of a dipole can be written in coordinate-free form: 


1 ; 
Buip(t) = ia bam -Ð — ml. (5.89) 
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Problem 5.37. A circular loop of wire, with radius R, lies in the xy plane (centered at the 
origin) and carries a current Z running counterclockwise as viewed from the positive z-axis. 


(a) What is its magnetic dipole moment? 

(b) What is the (approximate) magnetic field at points far from the origin? 

(c) Show that, for points on the z-axis, your answer is consistent with the exact field (Ex. 5.6), 
when z > R. 


Problem 5.38. Find the exact magnetic field a distance z above the center of a square loop of 
side w, carrying a current Z. Verify that it reduces to the field of a dipole, with the appropriate 
dipole moment, when z > w. 


Problem 5.39. 


(a) A phonograph record of radius R, carrying a uniform surface charge o, is rotating at 
constant angular velocity œ. Find its magnetic dipole moment. 

(b) Find the magnetic dipole moment of the spinning spherical shell in Ex. 5.11. Show that for 
points r > R the potential is that of a perfect dipole. 


Problem 5.40. | worked out the multipole expansion for the vector potential of a line current 
because that’s the most common type, and in some respects the easiest to handle. For a volume 
current J: 


(a) write down the multipole expansion, analogous to Eq. 5.80; 
(b) write down the monopole potential, and prove that it vanishes; 
(c) using Eqs. 1.107 and 5.86, show that the dipole moment can be written 


m = 5 expat, (5.90) 


More Problems on Chapter 5 
[EMi 


Problem 5.41. Analyze the motion of a particle (charge q, mass 77) in the magnetic field of a 
long straight wire carrying a steady current 7.7 


(a) Isits kinetic energy conserved? 

(b) Find the force on the particle, in cylindrical coordinates, with Z along the z-axis. 

(c) Obtain the equations of motion — that is, equations for 5, Q, and 2. The latter two can be 
integrated once, yielding equations for ọ and z (as functions of s). The equation for sis not 
so easy; instead, use the fact that the speed is a constant to get the formula for s. 

(d) Suppose 2 is constant. Describe the motion. 


27 For a numerical study of this configuration see Prob. 5.66. 
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Problem 5.42. Imagine two infinite straight steady currents, 7) and J», in arbitrary relative 
orientations. The distance of closest approach is d, and if you slide one over (without changing 
its direction) so that they intersect, they would make an angle a. Calculate the magnetic 
force of attraction/repulsion between the currents (in the original configuration).”* [Answer: 
5 Holib cot a.] 


Problem 5.43. It may have occurred to you that since parallel currents attract, the current 
within a single wire should contract into a tiny concentrated stream along the axis. Yet in practice 
the current typically distributes itself quite uniformly over the wire. How do you account for 
this? If the positive charges (density o+) are “nailed down,’ and the negative charges (density 
p—) move at speed v (and none of these depends on the distance from the axis), show that 
p- = —p+y7,where y = 1/41 — (v/c)? and c? = 1/uoeo. If the wire as a whole 
is neutral, where is the compensating charge located?” [Notice that for typical velocities (see 
Prob. 5.21), the two charge densities are essentially unchanged by the current, since y ~ 1. 
In plasmas, however, where the positive charges are also free to move, this so-called pinch 
effect can be very significant.] 


Problem 5.44. A current Z flows to the right through a rectangular bar of conducting material, 
in the presence of a uniform magnetic field B pointing out of the page (Fig. 5.59). 


(a) If the moving charges are positive, in which direction are they deflected by the magnetic 
field? This deflection results in an accumulation of charge on the upper and lower surfaces of 
the bar, which in turn produces an electric force to counteract the magnetic one. Equilibrium 
occurs when the two exactly cancel. (This is known as the Hall effect.) 

(b) Find the resulting potential difference (the Hall voltage) between the top and bottom of 
the bar, in terms of B, v (the speed of the charges), and the relevant dimensions of the bar.” 

(c) How would your analysis change if the moving charges were negative? [The Hall effect is 
the classic way of determining the sign of the mobile charge carriers in a material. ] 


IF l A 
AN 
I — NY =] 
E 
B i 
Fig. 5.59 Fig. 5.60 


Problem 5.45. A plane wire loop of irregular shape is situated so that part of it is in a uniform 
magnetic field B (in Fig. 5.60 the field occupies the shaded region, and points perpendicular to 
the plane of the loop). The loop carries a current 7. Show that the net magnetic force on the 


28 P-M. Binder, F. Cross, and J. K. Silva, Eur. J. Phys. 37, 045206 (2016). 
29 For further discussion, see D. C. Gabuzda, Am. J. Phys. 61, 360 (1993). 
30 The potential within the bar makes an interesting boundary-value problem. See M. J. 


Moelter, J. Evans, G. Elliot, and M. Jackson, Am. J. Phys. 66, 668 (1998). 
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loop is F = Bw, where w is the chord subtended. Generalize this result to the case where 
the magnetic field region itself has an irregular shape. What is the direction of the force? 


! Problem 5.46. A circularly symmetrical magnetic field (B depends only on the distance from 
the axis), pointing perpendicular to the page, occupies the shaded region in Fig. 5.61. 


Field region 


QJ 


Particle trajectory 
Fig. 5.61 


(a) Ifthe total flux ( f B - da) is zero, show that a charged particle that starts out at the center 
(with velocity v, in the plane) will emerge from the field region on a radial path (provided it 
escapes at all). On the reverse trajectory, a particle fired toward the center from outside will 
hit its target (if it has sufficient energy), though it may follow a weird route getting there. 
[Hint: Calculate the total angular momentum acquired by the particle, using the Lorentz 
force law.] 

(b) Alternatively, if the speed is insufficient (v < vesc), a particle fired out from the center will 
reach a critical radius se (still within the field region) and return to the center — or a particle 
fired radially inward from the outside will come to a critical radius and then fly back out, 
without ever making it to the center. Show that the critical radius is given by 


E. 
27mv 


Se = @(se), 


where mm is the mass of the particle, q is its charge, and v is its speed. (If there are multiple 
critical radii, a third possibility is for the particle to be trapped between them.) 

(c) Consider the case of a field that is Jinear in s/R (that is, B= Bo|(s/R)—a)). First 
determine the constant a, then find the critical radii and the “escape velocity” vesc.” 

(d) Suppose the field is uniform (Bo) out to radius a, and then reverses direction (— Bo) out to 
R. Find a, Sc, and vesc, and sketch the trajectory for (i) v slightly less than vesc and (ii) v 
slightly greater than Vege. 


Problem 5.47. Calculate the magnetic force of attraction between the northern and southern 
hemispheres of a spinning charged spherical shell (Ex. 5.11). 
(Answer: (70/4) ugo7@w R4] 


3! For more on this curious problem, see J. Franklin and K. Cole Newton, Am. J. Phys. 84, 
263 (2016). 
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Problem 5.48. Consider the motion of a particle with mass 7 and electric charge qe in the 
field of a (hypothetical) stationary magnetic monopole qm at the origin: 


(a) Find the acceleration of qe, expressing your answer in terms of q, qm, m, r (the position of 
the particle), and v (its velocity). 

(b) Show that the speed v = |v] is a constant of the motion. 

(c) Show that the vector quantity 

H0dedm 

4r 

is a constant of the motion. [Hint: Differentiate it with respect to time, and prove — using 
the equation of motion from (a) — that the derivative is zero.] For the physical interpretation 
of Q, see Prob. 8.21. 

(d) Choosing spherical coordinates (r, ©, œ), with the polar (z) axis along Q: 


Q=m(r Xv) - r 


(i) calculate Q - Q, and show that © is a constant of the motion (so qe moves on the 
surface of a cone — something Poincaré discovered in 1896); 
(ii) calculate Q - r, and show that the magnitude of Q is 


_ HO Wem |, 
Q= 4r 'cos OV 


(iii) calculate Q - Ô, show that 
ie 


dt r? 
and determine the constant k. 
(e) By expressing v2 in spherical coordinates, obtain the equation for the trajectory, in the form 
d _ 
dọ 
(that is: determine the function f (r)). 
(f) Solve this equation for r(@). 


f(r) 


Problem 5.49. Use the Biot—Savart law (most conveniently in the form of Eq. 5.42 appropriate 
to surface currents) to find the field inside and outside an infinitely long solenoid of radius R, 
with 7 turns per unit length, carrying a steady current Z. 


Problem 5.50. The magnetic field on the axis of a circular current loop (Eq. 5.41) is far from 
uniform (it falls off sharply with increasing z). You can produce a more nearly uniform field by 
using two such loops a distance d apart (Fig. 5.62). 


32 In point of fact, the charge follows a geodesic on the cone. The original paper 


is H. Poincaré, Comptes rendus de l'Academie des Sciences 123, 530 (1896); for a 
more modern treatment, see B. Rossi and S. Olbert, Introduction to the Physics of 
Space, McGraw-Hill, New York (1970). For extensive history see K. T. McDonald, 
http://kirkmed.princeton.edu/examples/birkeland.pdf. 
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Fig. 5.62 


(a) Find the field (B) as a function of z, and show that 6.B/0z is zero at the point midway 
between them (z = 0). 

(b) If you pick d just right, the second derivative of B will also vanish at the midpoint. This 
arrangement is known as a Helmholtz coil; its a convenient way of producing locally 
uniform fields in the laboratory. Determine d such that 67 B/Oz* = O at the midpoint, 
and find the resulting magnetic field at the center. [Answer: 8 1g 1/5 V5R]] 


Problem 5.51. Use Eq.5.41 to obtain the magnetic field on the axis of the rotating disk 
in Prob. 5.39a. Show that the dipole field (Eq. 5.88), with the dipole moment you found in 
Prob. 5.39, is a good approximation if z > R. 


Problem 5.52. Suppose you wanted to find the field of a circular loop (Ex. 5.6) ata point r that 
is not directly above the center (Fig. 5.63). You might as well choose your axes so that r lies in 
the yz plane at (0, y, z). The source point is (R cos œ’, R sin œ’, 0), and ’ runs from 0 
to 277. Set up the integrals? from which you could calculate By, By, and B-, and evaluate By 
explicitly. 


Problem 5.53. Magnetostatics treats the “source current” (the one that sets up the field) and 
the “recipient current” (the one that experiences the force) so asymmetrically that it is by no 
means obvious that the magnetic force between two current loops is consistent with Newton's 
third law. Show, starting with the Biot—Savart law (Eq. 5.34) and the Lorentz force law (Eq. 5.16), 
that the force on loop 2 due to loop 1 (Fig. 5.64) can be written as 


Fig. 5.64 


33 These are elliptic integrals — see R. H. Good, Eur. J. Phys. 22, 119 (2001). For a numerical 
study see Prob. 5.68. 
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ro Sir OK Z (5.91) 
TT I 


In this form, it is clear that F2 = —F, since 2 changes direction when the roles of 1 and 2 are 
interchanged. (If you seem to be getting an “extra” term, it will help to note that dlp -4 = d2.) 


Problem 5.54. Consider a plane loop of wire that carries a steady current J; we want to 
calculate the magnetic field at a point in the plane. We might as well take that point to be the 
origin (it could be inside or outside the loop). The shape of the wire is given, in polar coordinates, 
by a specified function (0) (Fig. 5.65). 


Fig. 5.65 


(a) Show that the magnitude of the field is** 
of ¢ dé 
B= a $ Z. (5.92) 


[Hint: Start with the Biot-Savart law; note that 2 = —r, and dl x r points perpendicular 
to the plane; show that |dI x f| = dl sin ọ = rdé.] 

(b) Test this formula by calculating the field at the center of a circular loop. 

(c) The lituus spiral is defined by 


Oo (0 <0 <2n) 


(for some constant a). Sketch this figure, and complete the loop with a straight segment 
along the x-axis. What is the magnetic field at the origin? 
(d) Fora conic section with focus at the origin, 


Pp 
os 1+ecos@’ 
where p is the semilatus rectum (the y-intercept) and e is the eccentricity (e = 0 
for a circle, O < e < 1 for an ellipse, e = 1 for a parabola). Show that the field (at the 
origin) is 

pa bol, 
2p 


regardless of the eccentricity.” 


34 J, A. Miranda, Am. J. Phys. 68, 254 (2000). 
35 C, Christodoulides, Am. J. Phys. 77, 1195 (2009). 
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Problem 5.55. 


(a) One way to fillin the “missing link” in Fig. 5.51 is to exploit the analogy between the defining 
equations for A (viz. V - A = 0,V x A = B)and Maxwell's equations for B (viz. 
V-B =0,V xB = uoJ). Evidently, A depends on B in exactly the same way that 
B depends on uoJ (to wit, the Biot—Savart law). Use this observation to write down the 
formula for A in terms of B. 

(b) The electrical analog to your result in (a) is 

1 Em), 
vo=-z; | z dt’. 
Derive it, by exploiting the appropriate analogy. 


! Problem 5.56. Another way to fill in the “missing link” in Fig. 5.51 is to look for a magnetostatic 
analog to Eq. 2.21. The obvious candidate would be 


r 
Aw = f (B x dl). 
oO 


(a) Test this formula for the simplest possible case — uniform B (use the origin as your reference 
point). Is the result consistent with Prob. 5.27? You could cure this problem by throwing in 
a factor of 5, but the flaw in this equation runs deeper. 

(b) Show that f (B x dl) is not independent of path, by calculating $ (B x dl) around the 
rectangular loop in Fig. 5.66. 
As far as | know, the best one can do along these lines is the pair of equations 


1 
(i) V@) = f E(Ar) dÀ, 
0 


1 
(ii) A@) = -r a AB(Ar) dÀ. 
0 


[Equation (i) amounts to selecting a radial path for the integral in Eq. 2.21, while (ii) 
constitutes a more “symmetrical” solution to Prob. 5.33.] 
(c) Use (ii) to find the vector potential for uniform B. 


<— w —_—> 


Fig. 5.66 


36 R. L. Bishop and S. I. Goldberg, Tensor Analysis on Manifolds, Macmillan, New York 
(1968), Sect. 4.5. This is known as the Poincaré gauge (see, for instance, J. D. Jackson, Am. 
J. Phys. 70, 917 (2002)). 
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(d) Use (ii) to find the vector potential of an infinite straight wire carrying a steady current J. 
Does (ii) automatically satisfy V - A = 0? [Answer: (19 1/270s)(zS — sz).] 


Problem 5.57. 


(a) Construct the scalar potential U (r) (Eq. 5.67) for an ideal magnetic dipole m. 

(b) Construct a scalar potential for the spinning spherical shell (Ex. 5.11). [Hint: For r > R this 
is a pure dipole field, as you can see by comparing Eqs. 5.69 and 5.87.] 

(c) Try doing the same for the interior of a solid spinning sphere. [Hint: If you solved Prob. 5.32, 
you already know the field; set it equal to — V U, and solve for U. What's the trouble?) 


Problem 5.58. Just as V - B = O allows us to express B as the curl of a vector potential 
(B = V x A),s0 V - A = O permits us to write A itself as the curl of a “higher” potential: 
A = V x W. (And this hierarchy can be extended ad infinitum.) 


(a) Find the general formula for W (as an integral over B), which holds when B > 0 at co, 
(b) Determine W for the case of a uniform magnetic field B. [Hint: See Prob. 5.27.] 
(c) Find W inside and outside an infinite solenoid. [Hint: See Ex. 5.12.] 


Problem 5.59. Prove the following uniqueness theorem: If the current density J is specified 
throughout a volume V, and either the potential A or the magnetic field B is specified on the 
surface S bounding V, then the magnetic field itself is uniquely determined throughout V. 
[Hint: First use the divergence theorem to show that 


ftwxu-cxvy-u- [Vx (Vx V)]} dt = f[Ux (Vx )|-da 
for arbitrary vector functions U and V.] 


Problem 5.60. A magnetic dipole m = —77z is situated at the origin, in an otherwise 
uniform magnetic field B = Boz. Show that there exists a spherical surface, centered at the 
origin, through which no magnetic field lines pass. Find the radius of this sphere, and sketch the 
field lines, inside and out. 


Problem 5.61. A thin uniform donut, carrying charge Q and mass M, rotates about its axis 
as shown in Fig. 5.67. 


(a) Find the ratio of its magnetic dipole moment to its angular momentum. This is called the 
gyromagnetic ratio (or magnetomechanical ratio). 


2 
S 


Fig. 5.67 
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(b) What is the gyromagnetic ratio for a uniform spinning sphere? [This requires no new 
calculation; simply decompose the sphere into infinitesimal rings, and apply the result of 
part (a).] 

(c) According to quantum mechanics, the angular momentum of a spinning electron is sh, 
where fi is Planck’s constant. What, then, is the electron’s magnetic dipole moment, in 
A - m”? [This semiclassical value is actually off by a factor of almost exactly 2. Dirac’s 
relativistic electron theory got the 2 right, and Feynman, Schwinger, and Tomonaga later 
calculated tiny further corrections. The determination of the electron’s magnetic dipole 
moment remains the finest achievement of quantum electrodynamics, and exhibits per- 
haps the most stunningly precise agreement between theory and experiment in all of 
physics. Incidentally, the quantity (e//27), where e is the charge of the electron and mis 
its mass, is called the Bohr magneton. ] 


Problem 5.62. 


(a) Prove that the average magnetic field, over a sphere of radius R, due to steady currents 
inside the sphere, is 


Bave = 7——> (5.93) 
TT 


where m is the total dipole moment of the sphere. Contrast the electrostatic result, 
Eq. 3.105. [This is tough, so l'Il give you a start: 


1 
Bave = ay fea 
ave nR 


Write B as (V x A), and apply Prob. 1.61b. Now put in Eq. 5.65, and do the surface integral 


first, showing that 
1 4 
i P da = z 


(see Fig. 5.68). Use Eq. 5.90, if you like.] 
(b) Show that the average magnetic field due to steady currents outside the sphere is the same 
as the field they produce at the center. 


Fig. 5.68 


Problem 5.63. A uniformly charged solid sphere of radius R carries a total charge Q, and is 
set spinning with angular velocity w about the z-axis. 


(a) What is the magnetic dipole moment of the sphere? 
(b) Find the average magnetic field within the sphere (see Prob. 5.62). 
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(c) Find the approximate vector potential at a point (r, 0), where r > R. 

(d) Find the exact potential at a point (r, @) outside the sphere, and check that it is consistent 
with (c). [Hint: Refer to Ex. 5.11.] 

(e) Find the magnetic field at a point (r, ©) inside the sphere (Prob. 5.32), and check that it is 
consistent with (b). 


Problem 5.64. Using Eq. 5.88, calculate the average magnetic field of a dipole over a sphere 
of radius R centered at the origin. Do the angular integrals first. Compare your answer with 
the general theorem in Prob. 5.62. Explain the discrepancy, and indicate how Eq. 5.89 can be 
corrected to resolve the ambiguity at r = O. (If you get stuck, refer to Prob. 3.52.) Evidently, the 
true field of a magnetic dipole is” 


1 Ue 2 
Baip(r) = 30m -f)f -m| + E mô? (r). (5.94) 


Compare the electrostatic analog, Eq. 3.106. 


Problem 5.65. A thin glass rod of radius R and length Z carries a uniform surface charge o. 
It is set spinning about its axis, at an angular velocity œ. Find the magnetic field at a distance 
s >> R from the axis, in the xy plane (Fig. 5.69). [Hint: Treat it as a stack of magnetic dipoles. 
Answer: Howo LR?/4|s? + RA] 


eae» 


L/2 


<y 


L/2 


Fig. 5.69 


Problem 5.66. As we have seen in Ex. 5.2 and Probs. 5.16, 5.46, and 5.48, the Lorentz force law 
can produce bizarre particle trajectories, even in relatively simple magnetic fields. Analytically 
soluble cases, though, are few and far between, and to explore further we must resort to 
numerical methods. An algorithm particularly well suited to motion in magnetostatic fields is 
magnetic velocity Verlet, which can be implemented in Mathematica using the following 
code: 


37 The delta-function term is responsible for the hyperfine splitting in atomic spectra — see, for 


example, D. J. Griffiths, Am. J. Phys. 50, 698 (1982). 


38 A. Chambliss and J. Franklin, Am. J. Phys. 88, 1075 (2020). 
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Mvverlet[x0 , vO, m, q, dt, n, BF] := 
Module[{X, V, alpha, x, v, Ch C, j}, 
X = Tablefo., {i, 1, n}1; 


X[[1]] = x0; 

Ws UelolelO.,, fa, i, mills 
VI[([1]] = vo; 

X = X0; 

MEVO 


KSR = 2, esm J) = a & Ay 
d = v + alpha Cross[v, BF[x]]; 
x =x +d dt; 
€ = BF([x]; 
v = (d + alpha Cross[d, C] 
ab euljelael 2 e aO (al + euler A Ca) 5 
XC(j]] = x; 


l3 
Return[{X, V}]; 


and 


ShowTrajectory[pts_] := Show[Graphics3D [Table [Line 
[{pts([jj]], ptslljj + 1]1}], i; 1, Length[pts] - 1}]]] 


Mvverlet takes as input parameters 
xo = {x(0), y(O), z(0)} (the initial position), 
vo ={vx(0), v (0), v-(0)} (the initial velocity), 
the mass of the particle — typically set to 1), 


m 


dE 


( 
q (the charge of the particle — typically set to 1), 
(the time step — small), 

( 


n the number of iterations — large). 


First you specify the magnetic field: 
BF[{x_, y-,2-}] := {Bx(x, y, z), By(x, y, Z), Bz(x, y, 2)}; 


then type 


{x,V} = MVVerlet[x0, vO, m, q, dt, n, BF]; 


(inserting your chosen parameters), and finally plot the trajectory by invoking 


ShowTrajectory [X] 


(a) Test it, for a uniform magnetic field (you might start with at = 0.005,n = 5000, 
x0 = {1,0,0},vO = {0,1,0.1},BF[{x,y,z}] = {0,0,1}, but to get a feel 
for how it works, try varying the parameters). 

(b) Check that the speed is constant, using 


vsquared = Table[V[[jj]l].VI[[j3j]], {33, 1, Length[v] }]; 
ListPlot [vsquared] 
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(c) Now apply it to the case of a long straight steady current (Prob. 5.41). First reproduce the 
helical motion predicted in Prob. 5.41d. You might as well set g//(277) =1, x0 = 
{1,0,0},vO = {0,.1,vz},butyou have to pick vz just right .... 

(d) Try other initial conditions to explore the rich variety of motions this field admits.” Here's a 
particularly strikingone:xO = {1,0,0},vO = {.05,.05,0}. 


Problem 5.67. Imagine an infinite wire running parallel to the y-axis, at x = aand z = 0, 
and a second wire running parallel to the z-axis, at x = —a and y = 0. Each wire carries a 
steady current J. In addition there is a constant magnetic field Bo.” 


(a) What is the net magnetic field, B(x, y, z)? (Leave it as the sum of three terms.) 

(b) Use the method of Prob. 2.66 to draw the magnetic field line starting at (0.4, 0, 0). Begin 
with tmax = 5; then try 75 and 150. Note that this is just a snippet of a single 
field line! [Seta = 1, 9/27 = 1, Bo = —(0.1, 0.1, 0.1).] 


Problem 5.68. In Ex. 5.6 we used the Biot—Savart law to calculate the magnetic field on the 
axis of a circular steady current loop of radius R. For points off the axis (Prob. 5.52) the field is 
more complicated;”" in cylindrical coordinates, 


E uol 1 R2 — s2 — 22 
B.S, $2) = 5 o = pra BE + K(k)], 
7 Hol z 1 je dee ee a 
Bs(s, $, 2) = > ees ee K(k), 


where K(k) is the complete elliptic integral of the first kind (E11 ipticK [k] , in Math- 
ematica), E(k) is the complete elliptic integral of the second kind (1 1ipticE [k]), 


and 
4Rs 


(a) Using the method of Prob. 2.66, plot the field line starting at (3.R/4)y. (Set wol/27 = 1 
and R = 1;let ¢run from 0 to 10.) 

(b) Now introduce a current 0.17 along the z-axis, and plot the field line for the combined 
field. This time let ¢ run to 200. Note that this is a single field line; eventually it would fill 
in the entire “toroidal” surface. 


k 


Problem 5.69. In Prob. 5.48 you studied the motion of an electric charge qe in the field of a 
stationary magnetic monopole g,,,, and found that it moves on the surface of a cone of opening 
angle O (measured from the axis), with the monopole at the vertex. In spherical coordinates 


(r, 8, >), 


39 J, Franklin, D. J. Griffiths, and N. Mann, Am. J. Phys. 90, 513 (2022). 

4 M. Hosada et al., Phys. Rev. E 80, 067202 (2009); M. Lieberherr, footnote 21. 

4l W, R. Smythe, Static and Dynamic Electricity, 3rd ed., Hemisphere, New York (1989), p. 
291. 
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A do vcos*( sin 0) Holdedm| tan O 
I ae 8 he r AE 
RE G EE Amg ° M Arin 


(mis the mass of the particle and v is its speed). 


(a) Plot the trajectory, using Mathematica’s ParametricPlot3D: 


ParametricPlot3D[{x[phi], ylphi], zlphi]}, {phi, -phio, phio}] 


(use A = 1, sin 0 = 0.05, and dy = 972). 

(b) Match the plotin (a) using magnetic velocity Verlet (Prob. 5.66). You cansetm = 1, qe = 1, 
and v = 1, but you'll need to figure out the initial position and velocity. 

(c) Notice how the charge approaches the monopole and then bounces back as the field lines 
converge. Of course, magnetic monopoles are (presumably) mythical, but the same “mirror” 
effect” occurs in the dipole field of the Earth, causing electrons to bounce back and forth 
between the north and south (magnetic) poles and producing the “whistlers” that used to 
irritate (and mystify) amateur radio operators. Use the method of part (b) and the field of 
a magnetic dipole (m = m2) to explore this phenomenon. Let uom/47 = 100, and 
start with, say,xO = {1.,0.,0.},vO = {0.,1.5,-3}. You'll discover 
a curious twist to the story: the electron retrace its path, but gradually circles around the 
globe.” 


42 See A. Chambliss and J. Franklin, footnote 38. 

43 This is an example of Stérmer’s problem, the study of charged particle motion in the 
magnetic field of the Earth. C. Stérmer, The Polar Aurora, Clarendon Press, Oxford (1955); 
Rossi and Olbert, footnote 32. 
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6.1 Magnetization 
Coo O ü(€ 


6.1.1 Diamagnets, Paramagnets, Ferromagnets 


If you ask the average person what “magnetism” is, you will probably 
be told about refrigerator decorations, compass needles, and the North 
Pole — none of which has any obvious connection with moving charges or 
current-carrying wires. And yet, in classical electrodynamics all magnetic 
phenomena are due to electric charges in motion; if you could examine 
a piece of magnetic material on an atomic scale you would find tiny 
currents: electrons orbiting around nuclei and spinning about their axes.! 
For macroscopic purposes, these current loops are so small that we may 
treat them as magnetic dipoles. Ordinarily, they cancel each other out 
because of the random orientation of the atoms. But when a magnetic field 
is applied, a net alignment of these dipoles occurs, and the medium becomes 
magnetically polarized, or magnetized. 

Unlike electric polarization, which is almost always in the same direction 
as E, some materials acquire a magnetization parallel to B (paramagnets) 
and some opposite to B (diamagnets). A few substances (called ferro- 
magnets, in deference to the most common example, iron) retain their 
magnetization even after the external field has been removed — for these, 
the magnetization is not determined by the present field but by the whole 
magnetic “history” of the object. Permanent magnets made of iron are 
the most familiar examples of magnetism, but from a theoretical point of 
view they are the most complicated; I'll save ferromagnetism for the end 
of the chapter, and begin with qualitative models of paramagnetism and 
diamagnetism. 


! That’s the classical picture — quantum mechanics tells a quite different story: the magnetic 
dipole moment associated with the electron’s intrinsic spin is localized at a point and fixed in 
magnitude. It is notoriously misleading to model it as a tiny spinning sphere — for one thing, a 
point on the “equator” would be traveling much faster than the speed of light (Prob. 8.19d). 
You can’t speed it up (or slow it down) by applying a torque. Electrons are subject to the Pauli 
exclusion principle and the quantization of energy and angular momentum. These radically 
nonclassical objects are essential to a proper understanding of magnetism in matter. But this 
is a book on classical electrodynamics, in which such considerations have no place. P1 do 
my best to provide a plausible account of the mechanisms involved, but remember that the 
resulting theory is unavoidably incomplete. 
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6.1.2 Torques and Forces on Magnetic Dipoles 


A magnetic dipole experiences a torque in a magnetic field, just as an electric 
dipole does in an electric field. Let’s calculate the torque on a rectangular 
current loop in a uniform field B. (Since any current loop could be built 
up from infinitesimal rectangles, with all the “internal” sides canceling, as 
suggested in Fig. 6.1, there is no real loss of generality here; but if you 
prefer to start from scratch with an arbitrary shape, see Prob. 6.2.) Center 
the loop at the origin, and tilt it an angle @ from the z-axis toward the y-axis 
(Fig. 6.2). Let B point in the z direction. The forces on the two sloping sides 
cancel (they tend to stretch the loop, but they don’t rotate it). The forces on 
the “horizontal” sides are likewise equal and opposite, so the net force on 
the loop is zero, but they do generate a torque: 


N = aF sin Ox. 
The magnitude of the force on each of these segments is 
F = IbB, 


and therefore 


N = JabBsin 0 $ = mBsin 0 Å, 


N=mxB, (6.1) 


or 


(a) (b) 
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where m = Tab is the magnetic dipole moment of the loop. Equation 6.1 
gives the torque on any localized current distribution, in the presence of a 
uniform field; in a nonuniform field it is the exact torque (about the center) 
for a perfect dipole of infinitesimal size. 

Notice that Eq. 6.1 is identical in form to the electrical analog, Eq. 4.4: 
N = px E. In particular, the torque is again in such a direction as to 
line the dipole up parallel to the field. It is this torque that accounts 
for paramagnetism. Since every electron constitutes a magnetic dipole — 
picture it, if you like (just don’t tell anybody), as a tiny spinning charged 
sphere — you might expect paramagnetism to be a universal phenomenon. 
Actually, quantum mechanics (specifically, the Pauli exclusion principle) 
tends to lock the electrons within a given atom together in pairs with 
opposing spins,? and this effectively neutralizes the torque on the 
combination. As a result, paramagnetism most often occurs in atoms 
or molecules with an odd number of electrons, where the “extra” unpaired 
member is subject to the magnetic torque. Even here, the alignment 
is far from complete, since random thermal collisions tend to upset 
the order. 

In a uniform field, the net force on a current loop is zero: 


F=1 f(alxB) = (f a) xB =0; 


the constant B comes outside the integral, and the net displacement $ dl 
around a closed loop vanishes. In a nonuniform field this is no longer the 
case. For example, suppose a circular wire ring of radius R, carrying a 
current J, is suspended above a short solenoid in the “fringing” region 
(Fig. 6.3). Here B has a radial component, and there is a net downward 
force on the loop (Fig. 6.4): 


F = 2nIRB cos 0. (6.2) 


2 This is not always true for the outermost electrons in unfilled shells. 
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N + 
e e m 
m p | I 
S = 
(a) Magnetic dipole (b) Electric dipole (c) Magnetic dipole 
(Gilbert model) (Ampère model) 


For an infinitesimal loop, with dipole moment m, in a field B, the force is 


F=V(m-B), (6.3) 


as you'll see in see Prob. 6.4. Once again the magnetic formula is identical 
to its electrical “twin,” if we write the latter in the form F = V(p- E).* 

If you're starting to get a sense of déjà vu, perhaps you will have more 
respect for those early physicists who thought magnetic dipoles consisted 
of positive and negative magnetic “charges” (north and south “poles,” 
they called them), separated by a small distance, just like electric dipoles 
(Fig. 6.5a). They wrote down a “Coulomb’s law” for the attraction and 
repulsion of these poles, and developed the whole of magnetostatics in 
exact analogy to electrostatics. It’s not a bad model, for many purposes — 
it gives the correct field of a dipole (at least, away from the origin), the 
right torque on a dipole (at least, on a stationary dipole), and the proper 
force on a dipole (at least, in the absence of external currents). But it’s 
bad physics, because there’s no such thing as a single magnetic north pole 
or south pole. If you break a bar magnet in half, you don’t get a north 
pole in one hand and a south pole in the other; you get two complete 
magnets. Magnetism is not due to magnetic monopoles, but rather to moving 
electric charges; magnetic dipoles are tiny current loops (Fig. 6.5c), and 
it’s an extraordinary thing, really, that the formulas involving m bear any 
resemblance to the corresponding formulas for p. Sometimes it is easier 
to think in terms of the Gilbert model of a magnetic dipole (separated 
monopoles), instead of the physically correct Ampére model (current loop). 
Indeed, this picture occasionally offers a quick and clever solution to an 
otherwise cumbersome problem (you just copy the corresponding result 
from electrostatics, changing p to m, 1/e€9 to uo, and E to B). But whenever 
the close-up features of the dipole come into play, the two models can 
yield strikingly different answers. My advice is to use the Gilbert model, 
if you like, to get an intuitive “feel” for a problem, but never rely on it for 
quantitative results. 


3 In the spirit of the footnote to Eq. 4.5, it would be safer to write Eq. 6.3 as F = mx (V x 
B) + (m - V)B, since m is not to be differentiated here. 
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Problem 6.1. Calculate the torque exerted on the square loop shown in Fig. 6.6, due to the 
circular loop (assume r is much larger than a or b). If the square loop is free to rotate, what will 
its equilibrium orientation be? 


Problem 6.2. Starting from the Lorentz force law, in the form of Eq. 5.16, show that the torque 
on any steady current distribution (not just a square loop) in a uniform field B is m x B. 


Problem 6.3. Find the force of attraction between two magnetic dipoles, mı and mo, ori- 
ented as shown in Fig. 6.7, a distance r apart, (a) using Eq. 6.2 and (b) using Eq. 6.3. 


zh 


e H 


mı m 


<y 


Fig. 6.7 Fig. 6.8 


Problem 6.4. Derive Eq. 6.3. [Here's one way to do it: Assume the dipole is an infinitesimal 
square, of side e (if it's not, chop it up into squares, and apply the argument to each one). Choose 
axes as shown in Fig. 6.8, and calculate F = 7 f (dlx B) along each of the four sides. Expand 
B in a Taylor series — on the right-hand side, for instance, 


3B 
B = B(0, e,z) ~ B(0,0, z) + aloa 


For more sophisticated methods, see Prob. 6.22 or Prob. 6.23.] 


Problem 6.5. A uniform current density J = Joz fills a slab straddling the yz plane, from 
x = —ato x = +a. À magnetic dipole m = mọ% is situated at the origin. 


(a) Find the force on the dipole, using Eq. 6.3. 

(b) Do the same for a dipole pointing in the y direction: m = mọoĵ. 

(c) Inthe electrostatic case, the expressions F = V(p- E) and F = (p - V)E are equivalent 
(prove it), but this is not the case for the magnetic analogs (explain why). As an example, 
calculate (m - V)B for the configurations in (a) and (b). 
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6.1.3 Effect of a Magnetic Field on Atomic Orbits 


Electrons not only spin, they also revolve around the nucleus — for simplicity, 
let’s assume the orbit is a circle of radius R (Fig. 6.9). Although technically 
this orbital motion does not constitute a steady current, in practice the 
period T = 27R/v is so short that unless you blink awfully fast, it’s going 
to look like a steady current: 

-e ey 

T  2nR 

(The minus sign accounts for the negative charge of the electron.) Accord- 
ingly, the orbital dipole moment (In R’) is 


T= 


1 
m = — 5 evRe. (6.4) 


Like any other magnetic dipole, this one is subject to a torque (m x B) 
when you turn on a magnetic field. But it’s a lot harder to tilt the entire 
orbit than it is the spin, so the orbital contribution to paramagnetism is 
small. There is, however, a more significant effect on the orbital motion: The 
electron speeds up or slows down, depending on the orientation of B. For 
whereas the centripetal acceleration v?/R is ordinarily sustained by electrical 
forces alone,* 

2 2 
ae = mer, (6.5) 
in the presence of a magnetic field there is an additional force, —e(v x B). 
For the sake of argument, let’s say that B is perpendicular to the plane of 
the orbit, as shown in Fig. 6.10; then 


y 
— + eVB = me—. 6.6 

evB = meg (6.6) 
Under these conditions, the new speed yY is greater than v: 


evB= R v =") = = (+ vv- v), 


4 To avoid confusion with the magnetic dipole moment m, I’ll write the electron mass with 
subscript: me. 
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or, assuming the change Av = y — vis small, 


eRB 
Av= . 
v 2Me 


(6.7) 


When B is turned on, then, the electron speeds up.° 
A change in orbital speed means a change in the dipole moment 
(Eq. 6.4): 


1 - R 
Am = ~5e(Av)Ré = -$ 


(6.8) 


e 


Notice that the change in m is opposite to the direction of B. (An electron 
circling the other way would have a dipole moment pointing upward, 
but such an orbit would be slowed down by the field, so the change is 
still opposite to B.) Ordinarily, the electron orbits are randomly oriented, 
and the orbital dipole moments cancel out. But in the presence of a 
magnetic field, each atom picks up a little “extra” dipole moment, and 
these increments are all antiparallel to the field. This is the mechanism 
responsible for diamagnetism. It is a universal phenomenon, affecting all 
atoms. However, it is typically much weaker than paramagnetism, and is 
therefore observed mainly in atoms with even numbers of electrons, where 
paramagnetism is usually absent. 

In deriving Eq. 6.8, I assumed that the orbit remains circular, with its 
original radius R. I cannot offer a justification for this at the present stage. 
If the atom is stationary while the field is turned on, then my assumption 
can be proved — this is not magnetostatics, however, and the details will 
have to await Chapter 7 (see Prob. 7.55). If the atom is moved into the field, 
the situation is enormously more complicated. But never mind — I’m only 
trying to give you a qualitative account of diamagnetism. Assume, if you 
prefer, that the velocity remains the same while the radius changes — the 
formula (Eq. 6.8) is altered (by a factor of 2), but the qualitative conclusion 
is unaffected. The truth is that this classical model is fundamentally flawed 
(diamagnetism is really a quantum phenomenon), so there’s not much point 


> I said (Eq. 5.11) that magnetic fields do no work, and are incapable of speeding a particle 
up. I stand by that. However, as we shall see in Chapter 7, a changing magnetic field induces 
an electric field, and it is the latter that accelerates the electrons in this instance. 
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in refining the details. What is important is the empirical fact that in 
diamagnetic materials the induced dipole moments point opposite to the 
magnetic field. 


6.1.4 Magnetization 


In the presence of a magnetic field, matter becomes magnetized; that is, 
upon microscopic examination, it will be found to contain many tiny 
dipoles, with a net alignment along some direction. We have discussed two 
mechanisms that account for this magnetic polarization: (1) paramagnetism 
(the dipoles associated with the spins of unpaired electrons experience a 
torque tending to line them up parallel to the field) and (2) diamagnetism 
(the speed of the electrons is altered in such a way as to change the orbital 
dipole moment in a direction opposite to the field). Whatever the cause, we 
describe the state of magnetic polarization by the quantity 


M = magnetic dipole moment per unit volume. (6.9) 


The vector M is called the magnetization; it plays a role analogous to 
the polarization P in electrostatics. In the following section, we will not 
worry about how the magnetization got there — it could be paramagnetism, 
diamagnetism, or even ferromagnetism — we shall take M as given, and 
calculate the field this magnetization itself produces. 

Incidentally, it may have surprised you to learn that materials other 
than the famous ferromagnetic trio (iron, nickel, and cobalt) are affected 
by a magnetic field at all. You cannot, of course, pick up a piece of 
wood or aluminum with a magnet. The reason is that diamagnetism and 
paramagnetism are extremely weak: it takes a delicate experiment and a 
powerful magnet to detect them at all. If you were to suspend a piece of 
paramagnetic material above a solenoid, as in Fig. 6.3, the induced magne- 
tization would be upward, and hence the force downward. By contrast, the 
magnetization of a diamagnetic object would be downward and the force 
upward. In general, when a sample is placed in a region of nonuniform field, 
paramagnets are attracted into the field, whereas diamagnets are repelled 
away. But the actual forces are pitifully weak — in a typical experimental 
arrangement the force on a comparable sample of iron would be 104 or 
10° times as great. That’s why it was reasonable for us to calculate the field 
inside a piece of copper wire, say, in Chapter 5, without worrying about the 
effects of magnetization.’ 


6 S. L. O'Dell and R. K. P. Zia, Am. J. Phys. 54, 32, (1986); R. Peierls, Surprises in Theoretical 
Physics, Princeton University Press, Princeton, NJ (1979), Sect. 4.3; R. P. Feynman, R. B. 
Leighton, and M. Sands, The Feynman Lectures on Physics, Vol. 2, Addison-Wesley, New 
York (1966), Sects. 34-36. 

In 1997, Andre Geim managed to levitate a live frog (diamagnetic) for 30 minutes; he was 
awarded the 2000 Ig Nobel prize for this achievement, and later (2010) the Nobel prize for 
research on graphene. See M. V. Berry and A. K. Geim, Eur. J. Phys. 18, 307 (1997) and 
Geim, Physics Today September 1998, p. 36. 
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Problem 6.6. Of the following materials, which would you expect to be paramagnetic and 
which diamagnetic: aluminum, copper, copper chloride (CuCly), carbon, lead, nitrogen (N2), 
salt (NaCl), sodium, sulfur, water? (Actually, copper is slightly diamagnetic; otherwise they're 
all what you'd expect.)* 


6.2 The Field of a Magnetized Object 
SSS SSS 


6.2.1 Bound Currents 


Suppose we have a piece of magnetized material; the magnetic dipole 
moment per unit volume, M, is given. What field does this object produce? 
Well, the vector potential of a single dipole m is given by Eq. 5.85: 
_ Uo mx A 

4n r` 

In the magnetized object, each volume element dt’ carries a dipole moment 
M dt’, so the total vector potential is (Fig. 6.11) 


M(r’) x 4 
Ac E S MO ar. (6.11) 


That does it, in principle. But, as in the electrical case (Section 4.2.1), the 
integral can be cast in a more illuminating form by exploiting the identity 


A(r) 


(6.10) 


dr’ 


8 In truth, the determination is more complicated than I have suggested. Although an 
individual atom (or molecule) with an odd number of electrons obviously has an unpaired 
electron, and is therefore paramagnetic, those with an even number can also have unpaired 
electrons (this is governed by Hund’s rule). Atomic oxygen, for example, is paramagnetic. 
Moreover, in molecules and solids, pairing may occur between shared electrons — thus 
bismuth (atomic number 83) is strongly diamagnetic in its solid phase. 
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With this, 


A(r) = E fme )x (V5 L) dv. 


Integrating by parts, using ea rule 7, gives 


A(r) = Rf Ziv xmeyjae’— | vx a) dv}. 


Problem 1.61b invites us to express the latter as a surface integral, 


1 1 
Aw= [ov x M(r’)] dt’ +e P (Mar) xaa, 


The first integral looks just like the potential of a volume current, 


Jp =V xM, 


while the second looks like the potential of a surface current, 


K, =M xn, 


where ñ is the normal unit vector. With these definitions, 


aw =i Ho 2 fs Jar’) ) dr ; E p AD aw. 


(6.12) 


(6.13) 


(6.14) 


(6.15) 


What this means is that the potential a hence also the field) is the 
same as would be produced by a volume current J, = V x M throughout 
the material, plus a surface current K, = Mxn, on the boundary. Instead of 
integrating the contributions of all the infinitesimal dipoles, using Eq. 6.11, 
we first determine the bound currents, and then find the field they produce, in 
the same way we would calculate the field of any other volume and surface 
currents. Notice the striking parallel with the electrical case: there the field 
of a polarized object was the same as that of a bound volume charge 


p» = -V- P plus a bound surface charge o, = P - fi. 


Example 6.1. Find the magnetic field of a uniformly magnetized 


sphere. 


Solution. Choosing the z-axis along the direction of M (Fig. 6.12), we 


have 


Jo=VXM=0, K,=Mxii= MsinO &. 
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Now, a rotating spherical shell, of uniform surface charge o, corre- 
sponds to a surface current density 
K = ov=owRsinO ĝ. 


It follows, therefore, that the field of a uniformly magnetized sphere is 
identical to the field of a spinning spherical shell, with the identifica- 
tion oRw — M. Referring back to Ex. 5.9, 


B= : uoM, (6.16) 


inside the sphere, while the field outside is the same as that of a perfect 
dipole, 


4 
m = z7RM. 


Notice that the internal field is uniform, like the electric field inside a 
uniformly polarized sphere (Eq. 4.14), although the actual formulas for 
the two cases are curiously different ( A in place of -4)? The external 
fields are also analogous: pure dipole in both instances. 


Problem 6.7. An infinitely long circular cylinder carries a uniform magnetization M parallel 
to its axis. Find the magnetic field (due to M) inside and outside the cylinder. 


yee 


ee 


Fig. 6.13 Fig. 6.14 


Problem 6.8. A very long circular cylinder of radius R carries a magnetization M = ks? Q, 
where kis a constant (Fig. 6.13). Find the magnetic field due to M, for points inside and outside 
the cylinder (and far from the ends). 


Problem 6.9. A short circular cylinder of radius a and length Z carries a “frozen-in” uniform 
magnetization M parallel to its axis. Find the bound current, and sketch the magnetic field of 
the cylinder. (Make three sketches: one for L > a, one for L < a, and one for L ~ a.) 
Compare this bar magnet with the bar electret of Prob. 4.1. 


? It is no accident that the same factors appear in the “contact” term for the fields of electric 
and magnetic dipoles (Eqs. 3.106 and 5.94). In fact, one good way to model a perfect dipole 
is to take the limit (R —> 0) of a uniformly polarized/magnetized sphere. 
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Problem 6.10. An iron rod of length Z and square cross section (side length a) is given a 
uniform longitudinal magnetization M, and then bent around into a circle with a narrow gap 
(width w), as shown in Fig. 6.14. Find the magnetic field at the center of the gap, assuming 
w <a < L. [Hint: Treat it as the superposition of a complete torus plus a square loop with 
reversed current.] 


6.2.2 Physical Interpretation of Bound Currents 


In the last section, we found that the field of a magnetized object is identical 
to the field that would be produced by a certain distribution of “bound” 
currents, J, and Ky. I want to show you how these bound currents arise 
physically. This will be a heuristic argument — the rigorous derivation has 
already been given. Figure 6.15 depicts a thin slab of uniformly magnetized 
material, with the dipoles represented by tiny current loops. Notice that all 
the “internal” currents cancel: every time there is one going to the right, a 
contiguous one is going to the left. However, at the edge there is no adjacent 
loop to do the canceling. The whole thing, then, is equivalent to a single 
ribbon of current J flowing around the boundary (Fig. 6.16). 

What is this current, in terms of M? Say that each of the tiny loops has 
area a and thickness ¢ (Fig. 6.17). In terms of the magnetization M, its 
dipole moment is m = Mat. In terms of the circulating current J, however, 
m = Ia. Therefore J = Mt, so the surface current is Kp = //t = M. Using the 


í 


=> 


iq i 
Am 


280 


6 Magnetic Fields in Matter 
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outward-drawn unit vector ñ (Fig. 6.16), the direction of K, is conveniently 
indicated by the cross product 


K, =Mxn. 


(This expression also records the fact that there is no current on the top 
or bottom surface of the slab; here M is parallel to n, so the cross product 
vanishes.) 

This bound surface current is exactly what we obtained in Section 6.2.1. 
It is a peculiar kind of current, in the sense that no single charge makes the 
whole trip — on the contrary, each charge moves only in a tiny loop within 
a single atom. Nevertheless, the net effect is a macroscopic current flowing 
over the surface of the magnetized object. We call it a “bound” current to 
remind ourselves that every charge is attached to a particular atom, but it’s 
a perfectly genuine current, and it produces a magnetic field in the same 
way any other current does. 

When the magnetization is nonuniform, the internal currents no longer 
cancel. Figure 6.18a shows two adjacent chunks of magnetized material, 
with a larger arrow on the one to the right suggesting greater magnetization 
at that point. On the surface where they join, there is a net current in the 
x direction, given by 


OM, 


Ix = [M:(y + dy) - M.(y)| dz = -> 
Y 


dy dz. 


The corresponding volume current density is therefore 


OM. 
ive w, 
(Jp); ay 


281 


6.3 The Auxiliary Field H 


By the same token, a nonuniform magnetization in the y direction would 
contribute an amount —0M,/0z (Fig. 6.18b), so 


OM, ôM, 
(Jp) x = ay = ae 
In general, then, 
J, =V XM, 


consistent, again, with the result of Section 6.2.1. 
Incidentally, like any other steady current, J, should obey the conserva- 
tion law 5.33: 


V- J, =09. 


Does it? Yes; the divergence of a curl is always zero. 
6.2.3 The Magnetic Field inside Matter 


Like the electric field, the true microscopic magnetic field inside matter 
fluctuates wildly from point to point and instant to instant. When we speak 
of “the” magnetic field in matter, we mean the macroscopic field: the average 
over regions large enough to contain many atoms. (The magnetization 
M is “smoothed out” in the same sense.) It is this macroscopic field 
that one obtains when the methods of Section 6.2.1 are applied to points 
inside magnetized material, as you can prove for yourself in the following 
problem. 


Problem 6.11. In Sect, 6.2.1, we began with the potential of a perfect dipole (Eq. 6.10), 
whereas in fact we are dealing with physical dipoles. Show, by the method of Section 4.2.3, that 
we nonetheless get the correct macroscopic field. 


6.3 The Auxiliary Field H 


6.3.1 Ampere’s Law in Magnetized Materials 


In Section 6.2, we found that the effect of magnetization is to establish 
bound currents J; = V x M within the material and K, = M xn on 
the surface. The field due to magnetization of the medium is just the field 
produced by these bound currents. We are now ready to put everything 
together: the field attributable to bound currents, plus the field due to 
everything else — which I shall call the free current. The free current might 
flow through wires embedded in the magnetized substance or, if the latter is 
a conductor, through the material itself. In any event, the total current can 
be written as 
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J = Jp + Jy. (6.17) 


There is no new physics in Eq. 6.17; it is simply a convenience to separate 
the current into these two parts, because they got there by quite different 
means: the free current is there because somebody hooked up a wire to a 
battery — it involves actual transport of charge; the bound current is there 
because of magnetization — it results from the conspiracy of many aligned 
atomic dipoles. 

In view of Eqs. 6.13 and 6.17, Ampére’s law can be written 


1 
me =J= Jy +J, = Jf +(V XM), 
0 
or, collecting together the two curls, 
1 
Vv x (—B-M) = Jy. 
0 


The quantity in parentheses is designated by the letter H: 


H = | Mm. (6.18) 
Ho 


In terms of H, then, Ampère’s law reads 


VxH=J;, (6.19) 


or, in integral form, 
$ H: dl = Liones (6.20) 


where Jy.,, is the total free current passing through the Ampèrian loop. 

The vector H plays a role in magnetostatics analogous to D in electro- 
statics: just as D allowed us to write Gauss’s law in terms of the free charge 
alone, H permits us to express Ampére’s law in terms of the free current 
alone — and free current is what we control directly. Bound current, like 
bound charge, comes along for the ride — the material gets magnetized, and 
this results in bound currents; we cannot turn them on or off independently, 
as we can free currents. In applying Eq. 6.20, all we need to worry about is 
the free current, which we know about because we put it there. In particular, 
when symmetry permits, we can calculate H immediately from Eq. 6.20 by 
the usual Ampére’s law methods. (For example, Probs. 6.7 and 6.8 can be 
done in one line by noting that H = 0.) 


Example 6.2. A long copper rod of radius R carries a uniformly dis- 
tributed (free) current J (Fig. 6.19). Find H inside and outside the rod. 


Solution. Copper is weakly diamagnetic, so the dipoles will line up 
opposite to the field. This results in a bound current running antipar- 
allel to J, within the wire, and parallel to 7 along the surface (Fig. 6.20). 
Just how great these bound currents will be we are not yet in a position 


283 


6.3 The Auxiliary Field H 


Tz% 


Co 


Ampèrian loop 


to say — but in order to calculate H, it is sufficient to realize that all the 
currents are longitudinal, so B, M, and therefore also H, are circum- 
ferential. Applying Eq. 6.20 to an Ampèrian loop of radius s < R, 


H278) = lp = 1—, 
T 


so, inside the wire, 


I 
Outside the wire 
I. 
H = — ; A 
DTA (s> R) (6.22) 


In the latter region (as always, in empty space) M = 0, so 


E ee (s> R) 
2ns ais 


the same as for a nonmagnetized wire (Ex. 5.7). Inside the wire, B 
cannot be determined at this stage, since we have no way of knowing 
M (though in practice the magnetization in copper is so slight that for 
most purposes we can ignore it altogether). 


Asit turns out, H is a more useful quantity than D. In the laboratory, you 
will frequently hear people talking about H (more often even than B), but 
you will never hear anyone speak of D (only E). The reason is this: To build 
an electromagnet you run a certain (free) current through a coil. The current 
is the thing you read on the dial, and this determines H (or at any rate, the 
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line integral of H); B depends on the specific materials you used and even, 
if iron is present, on the history of your magnet. On the other hand, if you 
want to set up an electric field, you do not plaster a known free charge on the 
plates of a parallel-plate capacitor; rather, you connect them to a battery of 
known voltage. It’s the potential difference you read on your dial, and that 
determines E (or rather, the line integral of E); D depends on the details of 
the dielectric you're using. If it were easy to measure charge, and hard to 
measure potential, then you’d find experimentalists talking about D instead 
of E. So the relative familiarity of H, as contrasted with D, derives from 
purely practical considerations; theoretically, they’re on an equal footing. 
Many authors call H, not B, the “magnetic field.” Then they have to 
invent a new word for B: the “flux density,” or magnetic “induction” (an 
absurd choice, since that term already has at least two other meanings in 
electrodynamics). Anyway, B is indisputably the fundamental quantity (and 
it’s the thing that goes in the Lorentz force law), so I shall continue to call it 
the “magnetic field.” The quantity H has no sensible name: just call it “H.” 


Problem 6.12. An infinitely long cylinder, of radius R, carries a “frozen-in” magnetization, 
parallel to the axis, 


M = ksz, 


where kis a constant and sis the distance from the axis; there is no free current anywhere. Find 
the magnetic field inside and outside the cylinder by two different methods: 


(a) As in Section 6.2, locate all the bound currents, and calculate the field they produce. 

(b) Use Ampére’s law (in the form of Eq. 6.20) to find H, and then get B from Eq. 6.18. (Notice 
that the second method is much faster, and avoids any explicit reference to the bound 
currents.) 


Problem 6.13. Suppose the field inside a large piece of magnetic material is Bo, so that Hp = 
(1/uo)Bo — M, where Mis a “frozen-in” magnetization. 


(a) Nowa small spherical cavity is hollowed out of the material (Fig. 6.21). Find the field at the 
center of the cavity, in terms of Bo and M. Also find H at the center of the cavity, in terms 
of Ho and M. 

(b) Do the same for a long needle-shaped cavity running parallel to M. 

(c) Do the same for a thin wafer-shaped cavity perpendicular to M. 


10 For those who disagree, I quote A. Sommerfeld’s Electrodynamics, Academic Press, New 


York (1952), p.45: “The unhappy term ‘magnetic field’ for H should be avoided as far 
as possible. It seems to us that this term has led into error none less than Maxwell 
himself.” The H due to magnetization (as opposed to free currents) is sometimes called 
the demagnetization field, because it tends to oppose M (see, for example, Prob. 6.14). 
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(a) Sphere (b) Needle (c) Wafer 
Fig. 6.21 


Assume the cavities are small enough so M, Bo, and Ho are essentially constant. Compare 
Prob. 4.16. [Hint: Carving out a cavity is the same as superimposing an object of the same shape 
but opposite magnetization.] 


6.3.2 A Deceptive Parallel 


Equation 6.19 looks just like Ampére’s original law (Eq. 5.56), except that 
the total current is replaced by the free current, and B is replaced by poH. 
As in the case of D, however, I must warn you against reading too much into 
this correspondence. It does not say that uoH is “just like B, only its source 
is Jy instead of J.” For the curl alone does not determine a vector field — 
you must also know the divergence. And whereas V - B = 0, the divergence 
of H is not, in general, zero. In fact, from Eq. 6.18, 


V-H=-V-M. (6.23) 


Only when the divergence of M vanishes (everywhere) is the parallel 
between B and uoH faithful. In particular, there is (in general) no “Biot— 
Savart” law giving H in terms of the free current, and there is no vector 
potential for H. 

If you think I’m being pedantic, consider the example of the bar magnet 
— a short cylinder of iron that carries a permanent uniform magnetization 
M parallel to its axis. (See Probs. 6.9 and 6.14.) In this case there is no 
free current anywhere, and a naive application of Eq. 6.20 might lead you 
to suppose that H = 0, and hence that B = uoM inside the magnet and 
B = 0 outside, which is nonsense. It is quite true that the cur] of H vanishes 
everywhere, but the divergence does not. (Can you see where V -M # 0?) 
Advice: When you are asked to find B or Hina problem involving magnetic 
materials, first look for symmetry. If the problem exhibits cylindrical, plane, 
solenoidal, or toroidal symmetry, then you can get H directly from Eq. 6.20 
by the usual Ampére’s law methods. (Evidently, in such cases V - M is 
automatically zero, since the free current alone determines the answer.) If 
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the requisite symmetry is absent, you'll have to think of another approach, 
and in particular you must not assume that H is zero just because there is 
no free current in sight. 


6.3.3 Boundary Conditions 


The magnetostatic boundary conditions of Section 5.4.2 can be rewritten 
in terms of H and the free current. From Eq. 6.23 it follows that 


Hbove a Hiziow = -(Mibove = Miaow) (6.24) 
while Eq. 6.19 says 
ll ll z A 
H ivove ~ *"below — Ky xn. (6.25) 


In the presence of materials, these are sometimes more useful than the 
corresponding boundary conditions on B (Eqs. 5.74 and 5.76): 


Bix = Baia =0 (6.26) 
and 
B! Bl = o(K xñ). (6.27) 


above below 


You might want to check them, for Ex. 6.2 or Prob. 6.14. 


Problem 6.14. For the bar magnet of Prob. 6.9, make careful sketches of M, B, and H, 
assuming L is about 2a. Compare Prob. 4.17. 


Problem 6.15. If Jy = 0 everywhere, the curl of H vanishes (Eq. 6.19), and we can express 
Has the gradient of a scalar potential W: 


H=-VW. 
According to Eq. 6.23, then, 
VW =V-M, 
so W obeys Poisson's equation, with — V - M as the “source.” This opens up all the machinery 
of Chapter 3. As an example, find the field inside a uniformly magnetized sphere (Ex. 6.1) by 
separation of variables. [Hint: V - M = O everywhere except at the surface (r = R), so W 


satisfies Laplace's equation in the regions r < Rand r > R; use Eq. 3.65, and from Eq. 6.24 
figure out the appropriate boundary condition on W.] 


6.4 Linear and Nonlinear Media 
T) 


6.4.1 Magnetic Susceptibility and Permeability 


In paramagnetic and diamagnetic materials, the magnetization is sustained 
by the field; when B is removed, M disappears. In fact, for most substances 
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the magnetization is proportional to the field, provided the field is not 
too strong. For notational consistency with the electrical case (Eq. 4.30), 
I should express the proportionality thus: 


1 . 
M= i XmB (incorrect!) (6.28) 
0 


But custom dictates that it be written in terms of H, instead of B: 


M = (mH. (6.29) 


The constant of proportionality Xm is called the magnetic susceptibility; it 
is a dimensionless quantity that varies from one substance to another — 
positive for paramagnets and negative for diamagnets. Typical values are 
around 1075 (see Table 6.1). 

Materials that obey Eq. 6.29 are called linear media. In view of Eq. 6.18, 


B = uo(H + M) = uo(l + Xm)H (6.30) 
for linear media. Thus B is also proportional to H:!! 
B= uH, (6.31) 
where 
U = Uoll + Xm). (6.32) 


The quantity u is called the permeability of the material.'? In a vacuum, 
where there is no matter to magnetize, the susceptibility Xm vanishes, and 
the permeability is uo. That’s why uo is called the permeability of free space. 


Table 6.1. Magnetic susceptibilities (unless otherwise specified, values are for 1 atm, 20°C). 
Data from Handbook of Chemistry and Physics, 91st ed., CRC Press, Boca Raton (2010) 


and other references. 

Material Susceptibility Material Susceptibility 
Diamagnetic: Paramagnetic: 

Bismuth -1.7 x 10-4 Oxygen (O2) 1.7 x 10-6 
Gold -3.4 x 10> Sodium 8.5 x 10% 
Silver -2.4 x 105 Aluminum 2.2x 10> 
Copper -9.7 x 10° Tungsten 7.0 x 10> 
Water -9.0 x 10% Platinum 2.7 x 107 
Carbon dioxide —1.1 x 1078 Oxygen (liquid) (200°C) 3.9 x 107° 
Hydrogen (H2) -2.1 x 107° Gadolinium 4.8 x 107! 


11 Physically, therefore, Eq. 6.28 would say exactly the same as Eq. 6.29, only the constant 
Xm would have a different value. Equation 6.29 is a little more convenient, because 
experimentalists find it handier to work with H than B. 

12 Tf you factor out uo, what's left is called the relative permeability: u, = 1 + Xm = u/uo. 
By the way, formulas for H in terms of B (Eq. 6.31, in the case of linear media) are called 
constitutive relations, just like those for D in terms of E. 
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Example 6.3. An infinite solenoid (n turns per unit length, current J) is 
filled with linear material of susceptibility Xm. Find the magnetic field 
inside the solenoid. 


Solution. Since B is due in part to bound currents (which we don’t yet 
know), we cannot compute it directly. However, this is one of those 
symmetrical cases in which we can get H from the free current alone, 
using Ampeére’s law in the form of Eq. 6.20: 


H = nIż 
(Fig. 6.22). According to Eq. 6.31, then, 
B= uo(l + XmnI2. 


If the medium is paramagnetic, the field is somewhat enhanced; if it’s 
diamagnetic, the field is slightly reduced. This reflects the fact that the 
bound surface current 


K, =Mxn= Xm(H x ñ) = Xmnlp 


is in the same direction as Z, in the former case (Xm > 0), and opposite 
in the latter (Xm < 0). 


You might suppose that linear media escape the defect in the parallel 
between B and H: since M and H are now proportional to B, does it not 
follow that their divergence, like B’s, must always vanish? Unfortunately, it 
does not;!? at the boundary between two materials of different permeability, 
the divergence of M can actually be infinite. For instance, at the end of a 
cylinder of linear paramagnetic material, M is zero on one side but not on 
the other. For the “Gaussian pillbox” shown in Fig. 6.23, $ M - da # 0, 
and hence, by the divergence theorem, V - M cannot vanish everywhere 
within it. 

Incidentally, the volume bound current density in a homogeneous linear 
material is proportional to the free current density: 


13 Formally, V -H = V - (7B) = iv -B+B.- vi) =B. Vi). so H is not divergenceless (in 
general) at points where u is changing. 
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Gaussian pillbox 


Y Paramagnet 


Vacuum 4 >M 


Jy =V XM = V X (Xn) = Xmdy. (6.33) 


In particular, unless free current actually flows through the material, all 
bound current will be at the surface. 


Problem 6.16. A coaxial cable consists of two very long cylindrical tubes, separated by linear 
insulating material of magnetic susceptibility Xn. A current Z flows down the inner conductor 
and returns along the outer one; in each case, the current distributes itself uniformly over the 
surface (Fig. 6.24). Find the magnetic field in the region between the tubes. As a check, calculate 
the magnetization and the bound currents, and confirm that (together, of course, with the free 
currents) they generate the correct field. 


Fig. 6.24 


Problem 6.17. A (free) current Z flows down a long straight wire of radius a. If the wire is 
made of linear material (copper, say, or aluminum) with susceptibility Xn, and the current is 
distributed uniformly, what is the magnetic field a distance s from the axis? Find all the bound 
currents. What is the net bound current flowing down the wire? 


Problem 6.18. A sphere of linear magnetic material is placed in an otherwise uniform mag- 
netic field Bo. Find the field inside the sphere. [Hint: See Prob. 6.15 or Prob. 4.23.] 


Problem 6.19. On the basis of the naïve model presented in Section 6.1.3, estimate the 
magnetic susceptibility of a diamagnetic metal such as copper. Compare your answer with the 
empirical value in Table 6.1, and comment on any discrepancy. 


6.4.2 Ferromagnetism 


In a linear medium, the alignment of atomic dipoles is maintained by 
a magnetic field imposed from the outside. Ferromagnets — which are 
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emphatically not linear'* — require no external fields to sustain the mag- 
netization; the alignment is “frozen in.” Like paramagnetism, ferromag- 
netism involves the magnetic dipoles associated with the spins of unpaired 
electrons. The new feature, which makes ferromagnetism so different from 
paramagnetism, is the interaction between nearby dipoles: in a ferromag- 
net, each dipole “likes” to point in the same direction as its neighbors. The 
reason for this preference is essentially quantum mechanical, and I shall 
not endeavor to explain it here; it is enough to know that the correlation 
is so strong as to align virtually 100 percent of the unpaired electron spins. 
If you could somehow magnify a piece of iron and “see” the individual 
dipoles as tiny arrows, it would look something like Fig. 6.25, with all the 
spins pointing the same way. 

But if that is true, why isn’t every wrench and skillet a powerful magnet? 
The answer is that the alignment occurs in relatively small patches, called 
domains. Each domain contains billions of dipoles, all lined up (these 
domains are actually visible under a microscope, using suitable etching 
techniques — see Fig. 6.26), but the domains themselves are randomly ori- 
ented. The household wrench contains an enormous number of domains, 
and their magnetic fields cancel, so the wrench as a whole is not magnetized. 
(Actually, the orientation of domains is not completely random; within a 
given crystal, there may be some preferential alignment along the crystal 
axes. But there will be just as many domains pointing one way as the 
other, so there is still no large-scale magnetization. Moreover, the crystals 
themselves are randomly oriented within any sizable chunk of metal.) 

How, then, would you produce a permanent magnet, such as they sell 
in toy stores? If you put a piece of iron into a strong magnetic field, the 
torque N = m x B tends to align the dipoles parallel to the field. Since they 
like to stay parallel to their neighbors, most of the dipoles will resist this 
torque. However, at the boundary between two domains, there are competing 
neighbors, and the torque will throw its weight on the side of the domain 
most nearly parallel to the field; this domain will win some converts, at the 


14 Tn this sense, it is misleading to speak of the susceptibility or permeability of a ferromagnet. 
The terms are used for such materials, but they refer to the proportionality factor between 
a differential increase in H and the resulting differential change in M (or B); moreover, they 
are not constants, but functions of H. 
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expense of the less favorably oriented one. The net effect of the magnetic 
field, then, is to move the domain boundaries. Domains parallel to the field 
grow, and the others shrink. If the field is strong enough, one domain takes 
over entirely, and the iron is said to be saturated. 

It turns out that this process (the shifting of domain boundaries in 
response to an external field) is not entirely reversible: when the field is 
switched off, there will be some return to randomly oriented domains, but 
it is far from complete — there remains a preponderance of domains in the 
original direction. You now have a permanent magnet. 

A simple way to accomplish this, in practice, is to wrap a coil of wire 
around the object to be magnetized (Fig. 6.27). Run a current J through 
the coil; this provides the external magnetic field (pointing to the left in 
the diagram). As you increase the current, the field increases, the domain 
boundaries move, and the magnetization grows. Eventually, you reach the 
saturation point, with all the dipoles aligned, and a further increase in 
current has no effect on M (Fig. 6.28, point b). 
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MA 


(Permanent (Saturation) 


magnet) 


f (Permanent 
magnet) 


e 
(Saturation) 


Now suppose you reduce the current. Instead of retracing the path 
back to M = 0, there is only a partial return to randomly oriented 
domains; M decreases, but even with the current off there is some residual 
magnetization (point c). The wrench is now a permanent magnet. If you 
want to eliminate the remaining magnetization, you'll have to run a current 
backward through the coil (a negative 7). Now the external field points to 
the right, and as you increase I (negatively), M drops down to zero (point d). 
If you turn J still higher, you soon reach saturation in the other direction — 
all the dipoles now pointing to the right (point e). At this stage, switching 
off the current will leave the wrench with a permanent magnetization to 
the right (point f). To complete the story, turn J on again in the positive 
sense: M returns to zero (point g), and eventually to the forward saturation 
point (b). 

The path we have traced out is called a hysteresis loop. Notice that the 
magnetization of the wrench depends not only on the applied field (that is, 
on J), but also on its previous magnetic “history.”!> For instance, at three 
different times in our experiment the current was zero (a, c, and f), yet the 
magnetization was different for each of them. Actually, it is customary to 
draw hysteresis loops as plots of B against H, rather than M against 7. (If 
our coil is approximated by a long solenoid, with n turns per unit length, 
then H = nI, so H and J are proportional. Meanwhile, B = uo(H + M), 
but in practice M is huge compared to H, so to all intents and purposes B 
is proportional to M.!°) 

To make the units consistent (teslas), I have plotted (10H) horizontally 
(Fig. 6.29); notice, however, that the vertical scale is 10* times greater than 
the horizontal one. Roughly speaking, uoĦH is the field our coil would 
have produced in the absence of any iron; B is what we actually got, and 
compared to uoHĦH, it is gigantic. A little current goes a long way, when 


15 Etymologically, the word hysteresis has nothing to do with the word history — nor with the 
word hysteria. It derives from a Greek verb meaning “lag behind.” 

16 For very large H, with M constant (saturated), B becomes proportional to H, so whereas 
the wings in Fig. 6.28 extend out horizontally, in Fig. 6.29 they increase linearly. 
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; » 
20  (uHx10*) 


you have ferromagnetic materials around. That’s why anyone who wants 
to make a powerful electromagnet will wrap the coil around an iron core. It 
doesn’t take much of an external field to move the domain boundaries, and 
when you do that, you have all the dipoles in the iron working with you. 

One final point about ferromagnetism: it all follows, remember, from the 
fact that the dipoles within a given domain line up parallel to one another. 
Random thermal motions compete with this ordering, but as long as the 
temperature doesn’t get too high, they cannot budge the dipoles out of 
line. It’s not surprising, though, that very high temperatures do destroy the 
alignment. What is surprising is that this occurs at a precise temperature 
(770°C, for iron). Below this temperature (called the Curie point), iron is 
ferromagnetic; above, it is paramagnetic. The abrupt change is an example 
of a phase transition, like the freezing or boiling of water. 


Problem 6.20. How would you go about demagnetizing a permanent magnet (such as the 
wrench we have been discussing, at point c in the hysteresis loop)? That is, how could you restore 
it to its original state, with M = Oat Z = 0? 


Problem 6.21. 
(a) Show that the energy of a magnetic dipole m in a magnetic field B is 


[Assume that the magnitude of the dipole moment is fixed, and all you have to do is move 
it into place and rotate it into its final orientation. The energy required to keep the current 
flowing is a different problem, which we will confront in Chapter 7.] Compare Eq. 4.6. 

(b) Show that the interaction energy of two magnetic dipoles separated by a displacement r is 
given by 


U = gp 3 ™ my = 3m; + ê) - ô). (6.35) 


Compare Eq. 4.7. 
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Fig. 6.30 


(c) Express your answer to (b) in terms of the angles O; and @> in Fig. 6.30, and use the result 
to find the stable configuration two dipoles would adopt if held a fixed distance apart, but 
left free to rotate. 

(d) Suppose you had a large collection of compass needles, mounted on pins at regular intervals 
along a straight line. How would they point (assuming the Earth's magnetic field can 
be neglected)? [A rectangular array of compass needles aligns itself spontaneously, and 
this is sometimes used as a demonstration of “ferromagnetic” behavior on a large scale. 
It’s a bit of a fraud, however, since the mechanism here is purely classical, and much 
weaker than the quantum mechanical exchange forces that are actually responsible for 
ferromagnetism.”] 


More Problems on Chapter 6 
E) 


! Problem 6.22. In Prob. 6.4, you calculated the force on a dipole by “brute force.’ Here's a more 
elegant approach. First write B (r) as a Taylor expansion about the center of the loop: 


B(r) = B(ro) + [(r — ro) - Vo]B(ro), 


where rọ is the position of the dipole and Vo denotes differentiation with respect to rọ. Put 
this into the Lorentz force law (Eq. 5.16) to obtain 


F= rha x [£ - Vo)B(ro)]. 


Or, numbering the Cartesian coordinates from 1 to 3, 


3 


F;=I1I can ri dl) [Vo, Bx (ro)], 
JAIEI 


where €;;x is the Levi-Civita symbol (+1 if ik = 123, 231, or 312; —1 if ijk = 132, 
213, or 321; O otherwise), in terms of which the cross product can be written 
(A x B); = pe € jj Aj By. Use Eq. 1.108 to evaluate the integral. Note that 


3 
yy EijkEljm = ÔilÔkm — ÔimÔkl» 
JE 
where ô;; is the Kronecker delta (Prob. 3.57). 


17 Foran intriguing exception, see B. Parks, Am. J. Phys. 74, 351 (2006), Sect. II. 
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Problem 6.23. Here's a three-line proof of Eq. 6.3:' Use Prob. 1.61f to show that the force on 
a current loop can be written F = 7 f V(B - da). (The integration is over the area enclosed 
by the loop, and V acts only on B.) Now suppose the loop is very small, so derivatives of B are 
essentially constant over the area: V [B (r) - da] ~ Vo[B(ro) - da], where ro is the “center” 
of the loop. Take it from there .... 


Problem 6.24. A familiar toy consists of donut-shaped permanent magnets (magnetization 
parallel to the axis), which slide frictionlessly on a vertical rod (Fig. 6.31). Treat the magnets as 
dipoles, with mass 7724 and dipole moment m. 


(a) Ifyou put two back-to-back magnets on the rod, the upper one will “float” — the magnetic 
force upward balancing the gravitational force downward. At what height (z) does it float? 
(b) If you now add a third magnet (parallel to the bottom one), what is the ratio of the 
two heights? (Determine the actual number, to three significant digits.) [Answer: (a) 
[3 uom?/2rmag] "4; (b) 0.8501] 
| 


i © 
wet P 
= = 


w= n 


Ze 


(a) (b) 
Fig. 6.31 


Problem 6.25. Imagine two charged magnetic dipoles (charge q, dipole moment m), con- 
strained to move on the z-axis (same as Prob. 6.24a, but without gravity). Electrically they repel, 
but magnetically (if both m’s point in the z direction) they attract. 


(a) Find the equilibrium separation distance. 

(b) What is the equilibrium separation for two electrons in this orientation. [Answer: 4.72 x 
10713 m] 

(c) Does there exist, then, a stable bound state of two electrons? 


Problem 6.26. Notice the following parallel: 


V-D=0, VxE=0, eE=D-P (no free charge), 
V-B=0, VxH=0, uoĦ =B- uoọM (nofree current). 


18 Suggested by K. T. McDonald. 
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Thus, the transcription D > B,E —> H, P > uoM, €o — uo turns an electrostatic 
problem into an analogous magnetostatic one. Use this, together with your knowledge of the 
electrostatic results, to rederive: 


(a) the magnetic field inside a uniformly magnetized sphere (Eq. 6.16); 

(b) the magnetic field inside a sphere of linear magnetic material in an otherwise uniform 
magnetic field (Prob. 6.18); 

(c) the average magnetic field over a sphere, due to steady currents within the sphere 
(Eq. 5.93). 


Problem 6.27. Compare Eqs. 2.15, 4.9, and 6.11. Notice that if p, P, and M are uniform, the 
same integral is involved in all three: 
ame 
{ 72 dt ; 


Therefore, if you happen to know the electric field of a uniformly charged object, you can 
immediately write down the scalar potential of a uniformly polarized object, and the vector 
potential of a uniformly magnetized object, of the same shape. Use this observation to obtain V 
inside and outside a uniformly polarized sphere (Ex. 4.2), and A inside and outside a uniformly 
magnetized sphere (Ex. 6.1). 


Fig. 6.32 


Problem 6.28. At the interface between one linear magnetic material and another, the 
magnetic field lines bend (Fig. 6.32). Show that tan @2/ tan O1 = u2/u1, assuming there 
is no free current at the boundary. Compare Eq. 4.68. 


Problem 6.29. A magnetic dipole m is embedded at the center of a sphere (radius R) of 
linear magnetic material (permeability u). Show that the magnetic field inside the sphere 
(0<r< R)is 
Mime 
4n lr? (2uo + u) R? 


What is the field outside the sphere? 
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Problem 6.30. You are asked to referee a grant application, which proposes to determine 
whether the magnetization of iron is due to “Ampère” dipoles (current loops) or “Gilbert” dipoles 
(separated magnetic monopoles). The experiment will involve a cylinder of iron (radius R 
and length L = 10.R), uniformly magnetized along the direction of its axis. If the dipoles are 
Ampère type, the magnetization is equivalent to a surface bound current K} = M p; if they 
are Gilbert type, the magnetization is equivalent to surface monopole densities op = +M 
at the two ends. Unfortunately, these two configurations produce identical magnetic fields, at 
exterior points. However, the interior fields are radically different — in the first case B is in the 
same general direction as MI, whereas in the second it is roughly opposite to M. The applicant 
proposes to measure this internal field by carving out a small cavity and finding the torque on 
a tiny compass needle placed inside. 

Assuming that the obvious technical difficulties can be overcome, and that the question 
itself is worthy of study, would you advise funding this experiment? If so, what shape cavity 
would you recommend? If not, what is wrong with the proposal? [Hint: Refer to Probs. 4.11, 4.16, 
6.9, and 6.13.] 


Electrodynamics 


7.1 Electromotive Force 
OO EE eee 


7.1.1 Ohm's Law 


To make a current flow, you have to push on the charges. How fast they 
move, in response to a given push, depends on the nature of the material. 
For most substances, the current density J is proportional to the force per 
unit charge, f: 


J =of. (7.1) 


The proportionality factor o (not to be confused with surface charge) is 
an empirical constant that varies from one material to another; it’s called 
the conductivity of the medium. Actually, the handbooks usually list the 
reciprocal of o, called the resistivity: p =1/o (not to be confused with 
charge density — I’m sorry, but we’re running out of Greek letters, and 
this is the standard notation). Some typical values are listed in Table 7.1. 
Notice that even insulators conduct slightly, though the conductivity of 
a metal is astronomically greater; in fact, for most purposes metals can 
be regarded as perfect conductors, with o = œ, while for insulators we can 
pretend o = 0. 

In principle, the force that drives the charges to produce the current could 
be anything — chemical, gravitational, or trained ants with tiny harnesses. 
For our purposes, though, it’s usually an electromagnetic force that does the 
job. In this case Eq. 7.1 becomes 


J =o(E+vxB). (7.2) 


Ordinarily, the velocity of the charges is sufficiently small that the second 


term can be ignored: 
J =oE. (7.3) 


(However, in plasmas, for instance, the magnetic contribution to f can be 
significant.) Equation 7.3 is called Ohm’s law, though the physics behind it 
is really contained in Eq. 7.1, of which 7.3 is just a special case. 

I know: you're confused because I said E=0 inside a conductor 
(Section 2.5.1). But that’s for stationary charges (J =0). Moreover, for 
perfect conductors E=J/o =0 even if current is flowing. In practice, metals 
are such good conductors that the electric field required to drive current in 
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Table 7.1. Resistivities, in ohm meters (all values are for 1 atm, 20°C). Data from Handbook of 
Chemistry and Physics, 91st ed., CRC Press, Boca Raton (2010) and other references. 


Material Resistivity Material Resistivity 
Conductors: Semiconductors: 

Silver 1.59 x 10-8 Sea water 0.2 
Copper 1.68 x 10-8 Germanium 0.46 
Gold 2.21 x 10-8 Diamond DA 
Aluminum 2.65 x 108 Silicon 2500 
Tron 9.61 x 108 Insulators: 

Mercury 9.61 x 1077 Water (pure) 8.3 x 10° 
Nichrome 1.08 x 10~° Glass 10°-10'4 
Manganese 1.44 x 10 Rubber 10!3-10!5 
Graphite 1.6 x 1075 Teflon 10?2-1074 


them is negligible. Thus we routinely treat the connecting wires in electric 
circuits (for example) as equipotentials. Resistors, by contrast, are made 
from poorly conducting materials. 


Example 7.1. A cylindrical resistor of cross-sectional area A and 
length L is made from material with conductivity o. (See Fig. 7.1; 
as indicated, the cross section need not be circular, but I do assume 
it is the same all the way down.) If we stipulate that the potential is 
constant over each end, and the potential difference between the ends 
is V, what current flows? 


L 


Fig.7.1 | 


Solution. As it turns out, the electric field is uniform within the wire 
(TIl prove this in a moment). It follows from Eq. 7.3 that the current 
density is also uniform, so 


[= JA =08A= Ay. 


Example 7.2. Two long coaxial metal cylinders (radii a and b) are sep- 
arated by material of conductivity o (Fig. 7.2). If they are maintained 
at a potential difference V, what current flows from one to the other, 
in a length Z? 
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Solution. The field between the cylinders is 


— A a 
~ Ines 


> 


where A is the charge per unit length on the inner cylinder. The current 


is therefore 
T= f Seda=o | E-da=2aL 
€0 


(The integral is over any surface enclosing the inner cylinder.) Mean- 
while, the potential difference between the cylinders is 


a A b 
v=- | E-dl=—In(7), 


_ 2noL 


sO 


~ In(bla) ` 


As these examples illustrate, the total current flowing from one electrode 
to the other is proportional to the potential difference between them: 


This, of course, is the more familiar version of Ohm’s law. The constant 
of proportionality R is called the resistance; it’s a function of the geometry 
of the arrangement and the conductivity of the medium between the elec- 
trodes. (In Ex. 7.1, R= (L/oA); in Ex. 7.2, R= In(b/a)/2m0L.) Resistance 
is measured in ohms (Q): an ohm is a volt per ampere. Notice that the 
proportionality between V and 7 is a direct consequence of Eq. 7.3: if you 
want to double V, you simply double the charge on the electrodes — that 
doubles E, which (for an ohmic material) doubles J, which doubles 7. 
For steady currents and uniform conductivity, 


1 
V-E=-v-J=0 (7.5) 


(Eq. 5.33), and therefore the charge density is zero; any unbalanced charge 
resides on the surface. (We proved this long ago, for the case of stationary 
charges, using the fact that E=0; turns out, it is still true when steady 
currents flow, for media satisfying Eq. 7.3.) It follows, in particular, that 
Laplace’s equation holds within a homogeneous ohmic material carrying 
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a steady current, so all the tools and tricks of Chapter 3 are available for 
calculating the potential. 


Example 7.3. I asserted that the field in Ex. 7.1 is uniform. Let’s prove 
it. 


Solution. Within the cylinder V obeys Laplace’s equation. What are 
the boundary conditions? At the left-hand end the potential is constant 
—we may as well set it equal to zero. At the right-hand end the potential 
is likewise constant — call it Vo. On the cylindrical surface, J-n = 0, or 
else charge would be leaking out into the surrounding space (which we 
take to be nonconducting). Therefore E- ñ = 0, and hence 0V/dn = 0. 
With V or its normal derivative specified on all surfaces, the field is 
uniquely determined (Prob. 3.5). But it’s easy to guess one potential 
that obeys Laplace’s equation and fits these boundary conditions: 
Voz 
FA > 
where z is measured along the axis. The uniqueness theorem guarantees 
that this is the solution. The corresponding field is 

Vo 


V(z) = 


E=-VV =-—3, 
L 


which is indeed uniform. 


Contrast the enormously more difficult problem that arises if the 
conducting material is removed, leaving only a metal plate at either end 
(Fig. 7.3). Evidently, in the present case charge arranges itself over the 
surface of the wire in just such a way as to produce a nice uniform field 
within. ! 


I don’t suppose there is any formula in physics more familiar than Ohm’s 
law, and yet it’s not really a true law, in the sense of Coulomb’s or Ampére’s; 
rather, it is a “rule of thumb” that applies pretty well to many substances. 
Youre not going to win a Nobel prize for finding an exception. In fact, when 
you stop to think about it, it’s a little surprising that Ohm’s law ever holds. 


' Calculating this surface charge is not easy. See, for example, J. D. Jackson, Am. J. Phys. 64, 
855 (1996). Nor is it a simple matter to determine the field outside the wire — see Prob. 7.45. 
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After all, a given field E produces a force gE (on a charge q), and according 
to Newton’s second law, the charge will accelerate. But if the charges are 
accelerating, why doesn’t the current increase with time, growing larger and 
larger the longer you leave the field on? Ohm’s law implies, on the contrary, 
that a constant field produces a constant current, which suggests a constant 
velocity. Doesn't that contradict Newton’s law? 

No, for we are forgetting the frequent collisions electrons make as they 
pass down the wire. It’s a little like this: Suppose you’re driving down a 
street with a stop sign at every intersection, so that, although you accelerate 
constantly in between, you are obliged to start all over again with each new 
block. Your average speed is then a constant, in spite of the fact that (save 
for the periodic abrupt stops) you are always accelerating. If the length of 
a block is A and your acceleration is a, the time it takes to go a block is 

2A 


t=.4/— 
a 


2 


and hence your average velocity is 


r =i pes [Aa 
ave = 7% = DA 


But wait! That’s no good either! It says that the velocity is proportional 
to the square root of the acceleration, and therefore that the current should 
be proportional to the square root of the field! There’s another twist to 
the story: In practice, the charges are already moving very fast because of 
their thermal energy. But the thermal velocities have random directions, and 
average to zero. The drift velocity we are concerned with is a tiny extra bit 
(Prob. 5.21). So the time between collisions is actually much shorter than 
we supposed; if we assume for the sake of argument that all charges travel 
the same distance A between collisions, then 


A 


>. 
Vthermal 


f= 


and therefore 


Vave = x at = ah 
os 2Vthermal ` 
If there are n molecules per unit volume, and f free electrons per molecule, 
each with charge q and mass m, the current density is 
nfà F ; nfaA¢? 


J= nfi Yave = = ( 
2Vihermal m 2MVihermal 


(7.6) 


I don’t claim that the term in parentheses is an accurate formula for the 
conductivity,” but it does indicate the basic ingredients, and it correctly 
predicts that conductivity is proportional to the density of the moving 
charges and (ordinarily) decreases with increasing temperature. 


2 This classical model (due to Drude) bears little resemblance to the modern quantum theory 


of conductivity. See, for instance, D. Park’s Introduction to the Quantum Theory, 3rd ed., 
McGraw-Hill, New York (1992), Chap. 15. 
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As a result of all the collisions, the work done by the electrical force is 
converted into heat in the resistor. Since the work done per unit charge is 
V and the charge flowing per unit time is 7, the power delivered is 


P=VI=PR. (7.7) 


This is the Joule heating law. With 7 in amperes and R in ohms, P comes 
out in watts (joules per second). 


N a 


(a) (b) 
Fig. 7.4 


Problem 7.1. Two concentric metal spherical shells, of radius a and b, respectively, are sepa- 
rated by weakly conducting material of conductivity o (Fig. 7.4a). 


(a) Ifthey are maintained ata potential difference V, what current flows from one to the other? 

(b) What is the resistance between the shells? 

(c) Notice that if b >> athe outer radius (b) is irrelevant. How do you account for that? Exploit 
this observation to determine the current flowing between two metal spheres, each of 
radius a, immersed deep in the sea and held quite far apart (Fig. 7.4b), if the potential 
difference between them is V. (This arrangement can be used to measure the conductivity 
of sea water.) 


Problem 7.2. A capacitor C has been charged up to potential Vo; at time ¢ = 0, it is con- 
nected to a resistor R, and begins to discharge (Fig. 7.5a). 


—— fo T 
v CSS 0 
+O o> E 
EE D Z 
R 
(a) (b) 
Fig. 7.5 


(a) Determine the charge on the capacitor as a function of time, Q(t). What is the current 
through the resistor, 7 (£)? 

(b) What was the original energy stored in the capacitor (Eq. 2.55)? By integrating Eq. 7.7, 
confirm that the heat delivered to the resistor is equal to the energy lost by the capacitor. 
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Now imagine charging up the capacitor, by connecting it (and the resistor) to a battery of 
voltage Vo, at time ¢ = O (Fig. 7.5b). 

(c) Again, determine O(f) and 7 (4). 

(d) Find the total energy output of the battery ( f Vol dt). Determine the heat delivered to 
the resistor. Whatis the final energy stored in the capacitor? What fraction of the work done 
by the battery shows up as energy in the capacitor? [Notice that the answer is independent 


a 8 


Fig. 7.6 


Problem 7.3. 


(a) Two metal objects are embedded in weakly conducting (nonpolarizable) material of con- 
ductivity o (Fig. 7.6). Show that the resistance between them is related to the capacitance 
of the arrangement by 

el 
ae 

(b) Suppose you connected a battery between 1 and 2, and charged them up to a potential 
difference Vo. If you then disconnect the battery, the charge will gradually leak off. Show 
that V(t) = Voe~“™, and find the time constant, z, in terms of €ọ ando. 


Problem 7.4. Suppose the conductivity of the material separating the cylinders in Ex. 7.2 is 
not uniform; specifically, ø (s) = k/s, for some constant k. Find the resistance between the 
cylinders. [Hint: Because o is a function of position, Eq. 7.5 does not hold, the charge density is 
not zero in the resistive medium, and E does not go like 1/s. But we do know that for steady 
currents J is the same across each cylindrical surface. Take it from there.] 


7.1.2 Electromotive Force 


If you think about a typical electric circuit — a battery hooked up to a light 
bulb, say (Fig. 7.7) — a perplexing question arises: In practice, the current 
is the same all the way around the loop; why is this the case, when the only 
obvious driving force is inside the battery? Off hand, you might expect a 
large current in the battery and none at all in the lamp. Who’s doing the 
pushing, in the rest of the circuit, and how does it happen that this push 
is exactly right to produce the same current in each segment? What’s more, 
given that the charges in a typical wire move (literally) at a snail’s pace (see 
Prob. 5.21), why doesn’t it take half an hour for the current to reach the light 
bulb? How do all the charges know to start moving at the same instant? 
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WH 


Answer: If the current were not the same all the way around (for instance, 
during the first split second after the switch is closed), then charge would 
be piling up somewhere, and — here’s the crucial point — the electric field 
of this accumulating charge is in such a direction as to even out the flow. 
Suppose, for instance, that the current into the bend in Fig. 7.8 is greater 
than the current out. Then charge piles up at the “knee,” and this produces 
a field aiming away from the kink.’ This field opposes the current flowing in 
(slowing it down) and promotes the current flowing out (speeding it up) until 
these currents are equal, at which point there is no further accumulation of 
charge, and equilibrium is established. It’s a beautiful system, automatically 
self-correcting to keep the current uniform, and it does it all so quickly that, 
in practice, you can safely assume the current is the same all around the 
circuit, even in systems that oscillate at radio frequencies. 

There are really two forces involved in driving charge around a circuit: the 
source, gf,, which is ordinarily confined to one portion of the loop (a bat- 
tery, say), and an electrostatic force, gE, which serves to smooth out the flow 
and communicate the influence of the source to distant parts of the circuit: 


f=f,+E. (7.8) 


The physical agency responsible for f, can be many different things: in a 
battery it’s a chemical force; in a piezoelectric crystal, mechanical pressure 
is converted into an electrical impulse; in a thermocouple it’s a temperature 
gradient that does the job; in a photoelectric cell it’s light; and in a Van de 
Graaff generator the electrons are literally loaded onto a conveyer belt and 
swept along. Whatever the mechanism, its net effect is determined by the 
line integral of f around the circuit: 


E= ptd= pta. (7.9) 


(Because $ E - dl = 0 for electrostatic fields, it doesn’t matter whether you 
use f or f,.) The quantity £ is called the electromotive force, or emf, of the 
circuit. It’s a lousy term, since this is not a force at all — it’s the integral of a 
force per unit charge. Some people prefer the word electromotance, but emf 
is so established that I think we’d better stick with it. 


3 The amount of charge involved is surprisingly small — see W. G. V. Rosser, Am. J. Phys. 38, 
265 (1970); nevertheless, the resulting field can be detected experimentally — see R. Jacobs, 
A. de Salazar, and A. Nassar, Am. J. Phys. 78, 1432 (2010). 
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Within an ideal source of emf (a resistanceless battery,* for instance), 
the net force on the charges is zero (Eq. 7.1 with o = œ), so E = -f,. The 
potential difference between the terminals (a and b) is therefore 


b b 
v=- E-di=[ f.-d=f,-d=€ (7.10) 


(we can extend the integral to the entire loop because f, =0 outside the 
source). The function of a battery, then, is to establish and maintain 
a voltage difference equal to the electromotive force (a 6 V battery, for 
example, holds the positive terminal 6 V above the negative terminal). The 
resulting electrostatic field drives current around the rest of the circuit 
(notice, however, that inside the battery f, drives current in the direction 
opposite to E).° 

Because it’s the line integral of f,, € can be interpreted as the work done 
per unit charge, by the source — indeed, in some books electromotive force 
is defined this way. However, as you'll see in the next section, there is some 
subtlety involved in this interpretation, so I prefer Eq. 7.9. 


Problem 7.5. A battery of emf £ and internal resistance r is hooked up to a variable “load” 
resistance, R. If you want to deliver the maximum possible power to the load, what resistance 
R should you choose? (You can’t change £ and r, of course.) 


A / M 
7th §R 


Fig. 7.9 


Problem7.6. A rectangular loop of wire is situated so that one end (height ⁄) is between the 
plates of a parallel-plate capacitor (Fig. 7.9), oriented parallel to the field E. The other end is way 
outside, where the field is essentially zero. What is the emf in this loop? If the total resistance 
is R, what current flows? Explain. [Warning: This is a trick question, so be careful; if you have 
invented a perpetual motion machine, there's probably something wrong with it.] 


7.1.3 Motional Emf 


In the last section, I listed several possible sources of electromotive force, 
batteries being the most familiar. But I did not mention the commonest one 


4 Real batteries have a certain internal resistance, r, and the potential difference between 


their terminals is € — Jr, when a current J is flowing. For an illuminating discussion of how 
batteries work, see D. Roberts, Am. J. Phys. 51, 829 (1983). 

Current in an electric circuit is somewhat analogous to the flow of water in a closed system 
of pipes, with gravity playing the role of the electrostatic field, and a pump (lifting the water 
up against gravity) in the role of the battery. In this story, height is analogous to voltage. 


307 


7.1 Electromotive Force 


b c 
j 

<— y — h Rg —; 
i 

a d 


of all: the generator. Generators exploit motional emfs, which arise when 
you move a wire through a magnetic field. Figure 7.10 suggests a primitive 
model for a generator. In the shaded region there is a uniform magnetic field 
B, pointing into the page, and the resistor R represents whatever it is (maybe 
a light bulb or a toaster) we’re trying to drive current through. If the entire 
loop is pulled to the right with speed v, the mobile charges in segment ab 
experience a magnetic force whose vertical component qvB drives current 
around the loop, in the clockwise direction. The emf is 


E= P imas -dì = vBh, (7.11) 


where / is the width of the loop. (The horizontal segments bc and ad 
contribute nothing, since the force there is perpendicular to the wire.) 

Notice that the integral you perform to calculate € (Eq. 7.9 or 7.11) is 
carried out at one instant of time — take a “snapshot” of the loop, if you 
like, and work from that. Thus dl, for the segment ab in Fig. 7.10, points 
straight up, even though the loop is moving to the right. You can’t quarrel 
with this —it’s simply the way emf is defined — but it is important to be clear 
about it. 

In particular, although the magnetic force is responsible for establishing 
the emf, it is not doing any work — magnetic forces never do work. Who, then, 
is supplying the energy that heats the resistor? Answer: The person who’s 
pulling on the loop, of course. But this is subtle, so listen closely. With the 
current flowing, the free charges in segment ab have a vertical velocity (call 
it u) in addition to the horizontal velocity v they inherit from the motion 
of the loop. Accordingly, the magnetic force has a component quB to the 
left. To counteract this, the person pulling on the wire must exert a force 
per unit charge 


fi pull = uB 


to the right (Fig. 7.11). This force is transmitted to the charge by the 
structure of the wire. 

Meanwhile, the particle is actually moving in the direction of the resul- 
tant velocity w, and the distance it goes is (h/ cos 0). The work done per 
unit charge (by the person pulling on the loop, on a charge as it ascends the 
“vertical” segment) is therefore 


h 
fiwa = (uB)(—) sin 8 = vBh = € 
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b Cc b c 
h 
h/cos ð 
a’ 
a d a a’ d 
(a) Integration path for computing (b) Integration path for calculating work 
E (follow the wire at one instant done (follow the charge around the 
of time). loop). 


(sin 0 coming from the dot product). As it turns out, then, the work done 
per unit charge is exactly equal to the emf, though the integrals are taken 
along entirely different paths (Fig. 7.12), and completely different forces are 
involved. To calculate the emf, you integrate around the loop at one instant, 
but to calculate the work done you follow a charge in its journey around the 
loop; fpu contributes nothing to the emf, because it is perpendicular to the 
wire, whereas fmag contributes nothing to work because it is perpendicular 
to the net motion of the charge.° 

There is a particularly nice way of expressing the emf generated in a 
moving loop. Let ® be the flux of B through the loop: 


= fB-aa (7.12) 
For the rectangular loop in Fig. 7.10, 
® = Bhx. 


As the loop moves, the flux decreases: 


(The minus sign accounts for the fact that dx/dt is negative.) But this is 
precisely the emf (Eq. 7.11); the emf generated in the loop is minus the rate 
of change of flux through the loop: 


6 For further discussion, see E. P. Mosca, Am. J. Phys. 42, 295 (1974). 
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Surface S 


Ribbon 


Loop at Loop at 
time t time (t+ df) 


Enlargement of da 


E=-—. (7.13) 


This is the flux rule for motional emf. 

Apart from its delightful simplicity, the flux rule has the virtue of 
applying to nonrectangular loops moving in arbitrary directions through 
nonuniform magnetic fields; in fact, the loop need not even maintain a fixed 
shape. 


Proof. Figure 7.13 shows a loop of wire at time ¢, and also a short 
time dt later. Suppose we compute the flux at time f, using surface S, 
and the flux at time ¢ + dt, using the surface consisting of S plus the 
“ribbon” that connects the new position of the loop to the old. The 
change in flux, then, is 


d® = D(t + dt) — D(f) = ®yithon = f B- da. 
ribbon 


Focus your attention on point P: in time dt, it moves to P’. Let v be 
the velocity of the wire, and u the velocity of a charge down the wire; 
w = y + u is the resultant velocity of a charge at P. The infinitesimal 
element of area on the ribbon can be written as 


da = (vx dl) dt 


(see inset in Fig. 7.13). Therefore 
d® 
GE = $B. (vx dl). 
Since w = (v+u) and u is parallel to dl, we can just as well write this as 


d® 
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B (into page) 


Now, the scalar triple product can be rewritten: 


B- (w x dl) = -(wx B) - dl, 


d® 


But (w x B) is the magnetic force per unit charge, fmag, SO 


d® 
“dt = - O fma i dl, 


and the integral of fmay is the emf: 


sO 


| aay o 


There is a sign ambiguity in the definition of emf (Eq. 7.9): Which way 
around the loop are you supposed to integrate? There is a compensating 
ambiguity in the definition of flux (Eq. 7.12): Which is the positive direction 
for da? In applying the flux rule, sign consistency is governed (as always) 
by your right hand: if your fingers define the positive direction around the 
loop, then your thumb indicates the direction of da. Should the emf come 
out negative, it means the current will flow in the negative direction around 
the circuit. 

The flux rule is a nifty shortcut for calculating motional emfs. It does 
not contain any new physics — just the Lorentz force law. But it can lead 
to error or ambiguity if you’re not careful. The flux rule assumes you have 
a single wire loop — it can move, rotate, stretch, or distort (continuously), 
but beware of switches, sliding contacts, or extended conductors allowing a 
variety of current paths. A standard “flux rule paradox” involves the circuit 
in Fig. 7.14. When the switch is thrown (from a to b) the flux through the 
circuit doubles, but there’s no motional emf (no conductor moving through 
a magnetic field), and the ammeter (A) records no current. 


Example 7.4. A metal disk of radius a rotates with angular velocity 
w about a vertical axis, through a uniform field B, pointing up. A 
circuit is made by connecting one end of a resistor to the axle and 
the other end to a sliding contact, which touches the outer edge of the 
disk (Fig. 7.15). Find the current in the resistor. 


7.1 Electromotive Force 


(Sliding contact) 


eed! 


Solution. The speed of a point on the disk at a distance s from the 
axis is v = ws, so the force per unit charge is fmag =v X B= wsBs. The 
emf is therefore 


a a Ba 
E= | fnweds = 08 | ‘ce 
0 0 2 


and the current is 


w Ba? 


Example 7.4 (the Faraday disk, or Faraday dynamo) involves a motional 
emf that you can’t calculate (at least, not directly) from the flux rule. The 
flux rule assumes the current flows along a well-defined path, whereas in 
this example the current spreads out over the whole disk. It’s not even clear 
what the “flux through the circuit” would mean in this context. 

Even more tricky is the case of eddy currents. Take a chunk of aluminum 
(say), and shake it around in a nonuniform magnetic field. Currents will 
be generated in the material, and you will feel a kind of “viscous drag” — 
as though you were pulling the block through molasses (this is the force I 
called f pu in the discussion of motional emf). Eddy currents are notoriously 
difficult to calculate,’ but easy and dramatic to demonstrate. You may have 
witnessed the classic experiment in which an aluminum disk mounted as a 


(a) (b) 


7 See, for example, W. M. Saslow, Am. J. Phys. 60, 693 (1992). 
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pendulum on a horizontal axis swings down and passes between the poles 
of a magnet (Fig. 7.16a). When it enters the field region it suddenly slows 
way down. To confirm that eddy currents are responsible, one repeats the 
demonstration using a disk that has many slots cut in it, to prevent the 
flow of large-scale currents (Fig. 7.16b). This time the disk swings freely, 
unimpeded by the field. 


Example 7.5.8 There are precious few eddy current problems that can 
be treated analytically. Here’s a rare example, solved by Hertz (at the 
age of 23) in 1880. Imagine a uniform, electrically neutral, spherical 
shell, of inner radius Rj, outer radius R2, and conductivity o, centered 
at the origin and spinning with constant angular velocity w, in the 
presence of a uniform magnetic field B.’ Find the eddy currents in the 
shell, in the steady state (fields, charges, currents all constant in time), 
if (a) w and B are in the same direction and (b) they are perpendicular. 


Solution. In electrostatics the charge density in a conductor is zero 
(any excess charge goes to the surface), and this is still true for 
steady currents in conductors at rest, as we found in Eq. 7.5. But for 
a conductor moving through a magnetic field we cannot ignore the 
velocity term in Eq. 7.2. Indeed, taking the divergence, and invoking 
Gauss’s law and the continuity equation, 


V-J=o[V-E+¥-(vxB))=o[—p+¥-(vxB)| see 
0 


In the steady state p is constant, and we are left with 
p =—€0V- (vx B). (7.14) 


(a) We might as well choose axes such that B = Bz and w = wz. For 
any point in a rigid rotating object, v = w xr, so, using the BAC-CAB 
rule, 


vxB=(@xr)xB=r(B-@) —@(B-r) = wB(r- 22) = wWB(xx+ yy), 
(7.15) 
and therefore 


p =—2€owB. (7.16) 


(There will be compensating charge at the two surfaces, of course.) The 
electric potential satisfies Poisson’s equation 


1 
VV =-—p = 20B (Ri <r< Ro) (7.17) 
0 


8 This example can be skipped without loss of continuity. For details and practical applica- 


g tions see M. A. Nurge et al., Am. J. Phys. 85, 596 (2017). 


or that they are included in B. 


Assume that any magnetic fields produced by the eddy currents themselves are negligible — 
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and the radial current vanishes at the boundaries (otherwise charge 
would be leaking off into space): 


J,=o[E+vxB]-r=0 (atr= R; and R3). (7.18) 
Thus 
s a WB. 2 a) 
E, = -wB(xx+ yy) r= -70 + y°) = —w Br sin 0 (7.19) 


at each surface: 
OV 


OV 

= +2 
or les = wWBR, sın 0, h 
Having specified the charge density (Eq. 7.16) and the normal deriva- 
tive of V on the boundaries (Eq. 7.20), the field in the shell is uniquely 


determined (Prob. 3.5). In fact, it’s easy to guess the potential: 


= wBR; sin’ 0. (7.20) 
2 


V(r, 0) = or sin? 0 + K = e +y?)+K (1.21) 
(for some constant K), and the field is 
E = -VV =-wB(xx + yy) = -v x B (7.22) 


(Eq. 7.15). As it turns out, then, the electric force exactly cancels the 
magnetic force; there is no eddy current in this configuration, 1? 


J=0. 
(b) This time we'll take w = wx and B = By. Then 
vx B=r(B-w) —- @(B-r) = —wByx, 
which is divergenceless, so the charge density vanishes (Eq. 7.14): 
p=0 


(there may still be induced surface charges). Laplace’s equation holds 
within the shell, 


VV =0, 
and Eq. 7.19 becomes 
E, = -(v x B) - fê = wBy(&-f) = wBr sin? 0 sin ọ cos p 


at each boundary: 


OV 
os 


The solution is 


= -w BR; sin’ 0 sin @ cos @, M 


hk = -w BR, siv? 0 sin ọ cos @. 
r !Ro 


B B 
V(r, 0,0) = -= sin? O sind cos P+K = — xy K(R <r< R), 


10 There is a convection current, since p (and the surface charges) are dragged around by the 


spinning of the shell, but (as you can check for yourself) the ratio Jconv/Jcond is typically of 
the order of €9w/o, which is utterly negligible for (say) aluminum at 100 rpm. 
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and the electric field is 
E=-VV = ros + x9). 


The eddy current is 


wBo 
a 
Remarkably, this is just what you would get if the shell had been a 
nonconductor carrying a uniform charge density p = Bo/2 and spin- 
ning with angular velocity w about the z-axis. That’s right: the z-axis! 
Incidentally, for w and B in arbitrary orientations the formula is 


J =o(E+ (v x B)] = or yà + xy) = rsin A. 


J= -Zx (w x B)]. 


Problem 7.7. A metal bar of mass 7 slides frictionlessly on two parallel conducting rails a 
distance / apart (Fig. 7.17). A resistor R is connected across the rails, and a uniform magnetic 
field B, pointing into the page, fills the entire region. 


(a) Ifthe bar moves to the right at speed v, what is the current in the resistor? In what direction 
does it flow? 

(b) Whatis the magnetic force on the bar? In what direction? 

(c) If the bar starts out with speed vo at time ¢ = O, and is left to slide, what is its speed at a 
later time 7? 

(d) The initial kinetic energy of the bar was, of course, Smvo-. Check that the energy delivered 


to the resistor is exactly 577V97. 


m 


Fig. 7.17 


Problem7.8. A square loop of wire (side length a) lies on a table, a distance s from a very 
long straight wire, which carries a current Z, as shown in Fig. 7.18. 


(a) Find the flux of B through the loop. 

(b) If someone now pulls the loop directly away from the wire, at speed v, what emf is 
generated? In what direction (clockwise or counterclockwise) does the current flow? 

(c) Whatif the loop is pulled to the right at speed v? 
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a 
a 
F 
AY 
t > 
I 
Fig. 7.18 


Problem 7.9. An infinite number of different surfaces can be fit to a given boundary line, and 
yet, in defining the magnetic flux through a loop, ® = if B- da, | never specified the particular 
surface to be used. Justify this apparent oversight. 


Problem 7.10. A square loop (side length a) is mounted on a vertical shaft and rotated at 
angular velocity œ (Fig. 7.19). A uniform magnetic field B points to the right. Find ECZ) for this 
alternating current generator. 


Problem 7.11. A square loop is cut out of a thick sheet of aluminum. It is then placed so that 
the top portion is in a uniform magnetic field B, and is allowed to fall under gravity (Fig. 7.20). 
(In the diagram, shading indicates the field region; B points into the page.) If the magnetic field 
is 1 T (a pretty standard laboratory field), find the terminal velocity of the loop (in m/s). Find 
the velocity of the loop as a function of time. How long does it take (in seconds) to reach, say, 
90 percent of the terminal velocity? What would happen if you cut a tiny slitin the ring, breaking 
the circuit? [Note: The dimensions of the loop cancel out; determine the actual numbers, in the 
units indicated.] 


Problem 7.12. Calculate the net magnetic torque on the eddy current in Ex. 7.5b. This torque 
will act as a brake on the rotation. What is the time constant for the (exponential) damping? 
Assume that the angular velocity lost in a given rotation is a small fraction of the total. Find the 
actual number, for aluminum (mass density 2.7 x 103 kg/m?) in the Earth's magnetic field. 


SiS a 
SS 
a 
—>> 
— B a 
—_> 


Žž 


Fig. 7.19 Fig. 7.20 
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7.2 Electromagnetic Induction 
see SSS 


7.2.1 Faraday’s Law 


In 1831, Michael Faraday reported on a series of experiments, including 
three that (with some violence to history) can be characterized as follows: 


Experiment! He pulled a loop of wire to the right through a magnetic field 
(Fig. 7.21a). A current flowed in the loop. 


Experiment 2 He moved the magnet to the left, holding the loop still 
(Fig. 7.21b). Again, a current flowed in the loop. 


Experiment 3 With both the loop and the magnet at rest (Fig. 7.21c), he 
changed the strength of the field (he used an electromagnet, and varied the 
current in the coil). Once again, current flowed in the loop. 


The first experiment, of course, is a straightforward case of motional 
emf; according to the flux rule, 


I don’t think it will surprise you to learn that exactly the same emf arises in 
Experiment 2 — all that really matters is the relative motion of the magnet 
and the loop. Indeed, in the light of special relativity it has to be so. But 
Faraday knew nothing of relativity, and in classical electrodynamics this 
simple reciprocity is a remarkable coincidence. For if the Joop moves, it’s a 
magnetic force that sets up the emf, but if the loop is stationary, the force 
cannot be magnetic — stationary charges experience no magnetic forces. In 
that case, what is responsible? What sort of field exerts a force on charges 
at rest? Well, electric fields do, of course, but there doesn’t seem to be any 
electric field in sight. 
Faraday had an ingenious inspiration: 


a changing magnetic field induces an electric field. | 


v v 
—_—> «+ 


(a) (b) A (c) 
Changing 


magnetic field 
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It is this induced!! electric field that accounts for the emf in Experiment 2.!* 
Indeed, if (as Faraday found empirically) the emf is again equal to the rate 
of change of the flux, 


d® 
e= þr a=- (7.23) 
then E is related to the change in B by the equation 


ôB 
feaz- | Fda (7.24) 


This is Faraday’s law, in integral form. We can convert it to differential form 
by applying Stokes’ theorem (see footnote to Eq. 2.14): 


(7.25) 


Note that Faraday’s law reduces to the old rule $ E-dl = 0 (or, in differential 
form, V x E = 0) in the static case (constant B) as, of course, it should. 

In Experiment 3, the magnetic field changes for entirely different reasons, 
but according to Faraday’s law an electric field will again be induced, giving 
rise to an emf —d®/dt. Indeed, one can subsume all three cases (and for that 
matter any combination of them) into a kind of universal flux rule: 


Whenever (and for whatever reason) the magnetic flux through a loop 
changes, an emf 


€=-— (7.26) 
will appear in the loop. 


Many people call this “Faraday’s law.”!? Maybe I’m overly fastidious, 
but I find this confusing. There are really two totally different mechanisms 
underlying Eq. 7.26, and to identify them both as “Faraday’s law” is a little 
like saying that because identical twins look alike we ought to call them 
by the same name. In Faraday’s first experiment it’s the Lorentz force law 
at work; the emf is magnetic. But in the other two it’s an electric field 
(induced by the changing magnetic field) that does the job. Viewed in this 


11 “Induce” is a subtle and slippery verb. It carries a faint odor of causation (“produce” would 
make this explicit) without quite committing itself. There is a sterile ongoing debate in the 
literature as to whether a changing magnetic field should be regarded as an independent 
“source” of electric fields (along with electric charge) — after all, the magnetic field itself is 
due to electric currents. It’s like asking whether the postman is the “source” of my mail. 
Well, sure — he delivered it to my door. On the other hand, Grandma wrote the letter. 
Ultimately, p and J are the sources of all electromagnetic fields, and a changing magnetic 
field merely delivers electromagnetic news from currents somewhere else. But it is often 
convenient to think of a changing magnetic field “producing” an electric field, and it won’t 
hurt you as long as you understand that this is the condensed version of a more complicated 
story. For a nice discussion, see S. E. Hill, Phys. Teach. 48, 410 (2010). 

You might argue that the magnetic field in Experiment 2 is not really changing —just moving. 
But if you sit at a fixed location, the field you experience changes as the magnet passes by. 
Electrical engineers like to say that the emf is generated by the “cutting” of magnetic field 
lines. 


w 
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light, it is quite astonishing that all three processes yield the same formula 
for the emf. In fact, it was precisely this “coincidence” that led Einstein 
to the special theory of relativity — he sought a deeper understanding of 
what is, in classical electrodynamics, a peculiar accident. But that’s a story 
for Chapter 12. In the meantime, I shall reserve the term “Faraday’s law” 
for electric fields induced by changing magnetic fields, and I do not regard 
Experiment 1 as an instance of Faraday’s law. 


Example 7.6. A long cylindrical magnet of length L and radius a 
carries a uniform magnetization M parallel to its axis. It passes at 
constant velocity v through a circular wire ring of slightly larger 
diameter (Fig. 7.22). Graph the emf induced in the ring, as a function 
of time. 


fa 


Solution. The magnetic field is the same as that of a long solenoid with 
surface current K, = M Q. So the field inside is B = oM, except near 
the ends, where it starts to spread out. The flux through the ring is zero 
when the magnet is far away; it builds up to a maximum of [19 Mna? as 
the leading end passes through; and it drops back to zero as the trailing 
end emerges (Fig. 7.23a). The emf is (minus) the derivative of ® with 
respect to time, so it consists of two spikes, as shown in Fig. 7.23b. 


Keeping track of the signs in Faraday’s law can be a real headache. For 
instance, in Ex. 7.6 we would like to know which way around the ring the 
induced current flows. In principle, the right-hand rule does the job (we 
called ® positive to the left, in Fig. 7.22, so the positive direction for current 
in the ring is counterclockwise, as viewed from the left; since the first spike in 
Fig. 7.23b is negative, the first current pulse flows clockwise, and the second 
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counterclockwise). But there’s a handy rule, called Lenz’s law, whose sole 
purpose is to help you get the directions right:!4 


nature abhors a change in flux. 


The induced current will flow in such a direction that the flux it produces 
tends to cancel the change. (As the front end of the magnet in Ex. 7.6 enters 
the ring, the flux increases, so the current in the ring must generate a field to 
the right — it therefore flows clockwise.) Notice that it is the change in flux, 
not the flux itself, that nature abhors (when the tail end of the magnet exits 
the ring, the flux drops, so the induced current flows counterclockwise, in an 
effort to restore it). Faraday induction is a kind of “inertial” phenomenon: 
A conducting loop “likes” to maintain a constant flux through it; if you try 
to change the flux, the loop responds by sending a current around in such 
a direction as to frustrate your efforts. (It doesn’t succeed completely; the 
flux produced by the induced current is typically only a tiny fraction of the 
original. All Lenz’s law tells you is the direction of the flow.) 


Example 7.7. (The “jumping ring” demonstration.) If you wind a 
solenoidal coil around an iron core (the iron is there to beef up the 
magnetic field), place a metal ring on top, and plug it in, the ring will 
jump several feet in the air (Fig. 7.24). Why? 


B 


y 


Solenoid 


Solution. Before you turned on the current, the flux through the ring 
was zero. Afterward a flux appeared (upward, in the diagram), and the 
emf generated in the ring led to a current (in the ring) which, according 
to Lenz’s law, was in such a direction that its field tended to cancel this 
new flux. This means that the current in the loop is opposite to the 
current in the solenoid. And opposite currents repel, so the ring flies 
off. 15 


14 Lenz’s law applies to motional emfs, too, but for them it is usually easier to get the direction 
of the current from the Lorentz force law. 

15 For further discussion of the jumping ring (and the related “floating ring”), see C. S. 
Schneider and J. P. Ertel, Am. J. Phys. 66, 686 (1998); P. J. H. Tjossem and E. C. Brost, 
Am. J. Phys. 79, 353 (2011). 
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Problem 7.13. A long solenoid, of radius a, is driven by an alternating current, so that the 
field inside is sinusoidal: B(t) = Bo cos(wt)z. A circular loop of wire, of radius a/2 and 
resistance R, is placed inside the solenoid, and coaxial with it. Find the current induced in the 
loop, as a function of time. 


Problem 7.14. A square loop of wire, with sides of length a, lies in the first quadrant of the 
xy plane, with one corner at the origin. In this region, there is a nonuniform time-dependent 
magnetic field B(y, t) = ky? £2 (where k is a constant). Find the emf induced in the loop. 


Problem 7.15. As a lecture demonstration a short cylindrical bar magnet is dropped down a 
vertical aluminum pipe of slightly larger diameter, about 2 m long. It takes several seconds to 
emerge at the bottom, whereas an otherwise identical piece of unmagnetized iron makes the 
trip ina fraction of a second. Explain why the magnet falls more slowly." 


7.2.2 The Induced Electric Field 


Faraday’s law generalizes the electrostatic rule Vx E=0 to the time- 
dependent régime. The divergence of E is still given by Gauss’s law 
(V-E= pleo). If E is a pure Faraday field (due exclusively to a changing B, 
with p = 0), then 

V-E=0, vxE=—28. 

ôt 

This is mathematically identical to magnetostatics, 

V-B=0, VxBe= u. 


Conclusion: Faraday-induced electric fields are determined by —(0B/0f) in 
exactly the same way as magnetostatic fields are determined by uoJ. The 
analog to Biot—Savart is!” is 

1 (OB/01) xå 1 ð Bx2z 


E= ss 72 dt = ET 7 dt, (7.27) 


and if symmetry permits, we can use all the tricks associated with Ampére’s 
law in integral form ( $ B-dl = Uolenc), only now it’s Faraday’s law in integral 


form: 
d® 
pEa=-2. (7.28) 


The rate of change of (magnetic) flux through the Ampérian loop plays the 
role formerly assigned to [olenc. 


'6 For a discussion of this amazing demonstration see K. D. Hahn et al., Am. J. Phys. 66, 1066 
(1998); G. Donoso, C. L. Ladera, and P. Martin, Am. J. Phys. 79, 193 (2011). 

17 Magnetostatics holds only for time-independent currents, but there is no such restriction 
on OB/dt. 
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Example 7.8. A uniform magnetic field B(£), pointing straight up, fills 
the shaded circular region of Fig. 7.25. If Bis changing with time, what 
is the induced electric field? 


Solution. E points in the azimuthal direction, just like the magnetic 
field inside a long straight wire carrying a uniform current density. 
Draw an Ampeéerian loop of radius s, and apply Faraday’s law: 


d® d 5 7 dB 
pea = E(2ms) = = -g B(t)) = -rs T 


Therefore, 


If B is increasing, E runs clockwise, as viewed from above. 


Bo 


Bt) P 


Rotation 


direction 
Ampèrian loop 


Example 7.9. A line charge A is glued onto the rim of a wheel of radius 
b, which is then suspended horizontally, as shown in Fig. 7.26, so that it 
is free to rotate (the spokes are made of some nonconducting material — 
wood, maybe). In the central region, out to radius a, there is a uniform 
magnetic field Bo, pointing up. Now someone turns the field off. What 
happens? 

Solution. The changing magnetic field will induce an electric field, 
curling around the axis of the wheel. This electric field exerts a force 
on the charges at the rim, and the wheel starts to turn. According to 
Lenz’s law, it will rotate in such a direction that its field tends to restore 
the upward flux. The motion, then, is counterclockwise, as viewed from 
above. 

Faraday’s law, applied to the loop at radius b, says 


2 dB; 
pea = E(27b) = 2 = -ne E, or E = -5 5,6. 


The torque on a segment of length dl is (r x F), or bA Edi. The total 
torque on the wheel is therefore 
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and the angular momentum imparted to the wheel is 


0 
f Ndt=-Anab | dB = Ana bBo. 
Bo 
It doesn’t matter how quickly or slowly you turn off the field; the 
resulting angular velocity of the wheel is the same regardless. (If you 
find yourself wondering where the angular momentum came from, 
youre getting ahead of the story! Wait for the next chapter.) 

Note that it’s the electric field that did the rotating. To convince you 
of this, I deliberately set things up so that the magnetic field is zero at 
the location of the charge. The experimenters may tell you they never 
put in any electric field — all they did was switch off the magnetic field. 
But when they did that, an electric field automatically appeared, and 
it’s this electric field that turned the wheel. 


I must warn you, now, of a small fraud that tarnishes many applications 
of Faraday’s law: electromagnetic induction, of course, occurs only when 
the magnetic fields are changing, and yet we would like to use the apparatus 
of magnetostatics (Ampére’s law, the Biot—Savart law, and the rest) to 
calculate those magnetic fields. Technically, any result derived in this way 
is only approximately correct. But in practice the error is usually negligible, 
unless the field fluctuates extremely rapidly, or you are interested in points 
very far from the source. Even the case of a wire snipped by a pair of 
scissors (Prob. 7.19) is static enough for Ampére’s law to apply. This régime, 
in which magnetostatic rules can be used to calculate the magnetic field 
on the right-hand side of Faraday’s law, is called quasistatic. Generally 
speaking, it is only when we come to electromagnetic waves and radiation 
that we must worry seriously about the breakdown of magnetostatics 
itself. 


Example 7.10. An infinitely long straight wire carries a slowly varying 
current I(t). Determine the induced electric field, as a function of the 
distance s from the wire. !8 


Solution. In the quasistatic approximation, the magnetic field is 
(uol/27s), and it circles around the wire. Like the B-field of a solenoid, 
E here runs parallel to the axis. For the rectangular “Ampeérian loop” 
in Fig. 7.27, Faraday’s law gives 


This example is artificial, and not just in the obvious sense of involving infinite wires, but 
ina more subtle respect. It assumes that the current is the same (at any given instant) all the 
way down the line. This is a safe assumption for the short wires in typical electric circuits, 
but not for /ong wires (transmission lines), unless you supply a distributed and synchronized 
driving mechanism. But never mind — the problem doesn’t enquire how you would produce 
such a current; it only asks what fields would result if you did. Variations on this problem 
are discussed by M. A. Heald, Am. J. Phys. 54, 1142 (1986). 
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l 
| i Ampèrian loop 
l 
l 


P E-d= Esi -Eo) =- [Beda 


dt 
uol dI ir 1, pol dl 
Ss — d Se l -1l . 
2n dt Js, s’ 27 a i 
Thus 
Ed nsa] 
E(s) = [zg Ins + K] (7.29) 


where K is a constant (that is to say, it is independent of s — it might 
still be a function of 7). The actual value of K depends on the whole 
history of the function /(¢) — we'll see some examples in Chapter 10. 

Equation 7.29 has the peculiar implication that E blows up as s goes 
to infinity. That can’t be true. ... What’s gone wrong? Answer: We have 
overstepped the limits of the quasistatic approximation. As we shall 
see in Chapter 9, electromagnetic “news” travels at the speed of light, 
and at large distances B depends not on the current now, but on the 
current as it was at some earlier time (indeed, a whole range of earlier 
times, since different points on the wire are different distances away). 
If t is the time it takes J to change substantially, then the quasistatic 
approximation should hold only for 


S< CT, (7.30) 


and hence Eq. 7.29 simply does not apply, at extremely large s. 


Problem 7.16. A long solenoid with radius a and 7 turns per unit length carries a time- 
dependent current Z(+) inthe h direction. Find the electric field (magnitude and direction) at a 
distance s from the axis (both inside and outside the solenoid), in the quasistatic approximation. 


Problem 7.17. An alternating current Z = Zo cos(w7) flows down a long straight wire, and 
returns along a coaxial conducting tube of radius a. 


(a) In what direction does the induced electric field point (radial, circumferential, or 
longitudinal)? 
(b) Assuming that the field goes to zero as s — co, find E(s, £).? 


19 This is not at all the way electric fields actually behave in coaxial cables, for reasons 
suggested in footnote . See Section 9.5.3, or J. G. Cherveniak, Am. J. Phys. 54, 946 (1986), 
for a more realistic treatment. 
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Problem7.18. Along solenoid of radius a, carrying z: turns per unit length, is looped by a wire 
with resistance R, as shown in Fig. 7.28. 


cron 


Fig. 7.28 


(a) Ifthe currentin the solenoid is increasing at a constant rate (d7/dt = k), what current flows 
in the loop, and which way (left or right) does it pass through the resistor? 

(b) Ifthe current Z in the solenoid is constant but the solenoid is pulled out of the loop (toward 
the left, to a place far from the loop), what total charge passes through the resistor? 


Problem 7.19. A square loop, side a, resistance R, lies a distance s from an infinite straight 
wire that carries current J (Fig. 7.29). Now someone cuts the wire, so J drops to zero. In what 
direction does the induced current in the square loop flow, and what total charge passes a given 
point in the loop during the time this current flows? If you don’t like the scissors model, turn the 
current down gradually: 


(l-afl for0<t< l/ga, 
I(t) = 


fort > Ia. 


SY 


Fig. 7.29 


Problem 7.20. A toroidal coil has a rectangular cross section, with inner radius a, outer radius 
a + w, and height A. It carries a total of VV tightly wound turns, and the current is increasing 
at a constant rate (dI/dt = k). lf w and hare both much less than a, find the electric field at 
a point z above the center of the toroid. [Hint: Exploit the analogy between Faraday fields and 
magnetostatic fields, and refer to Ex. 5.6.] 


Problem 7.21. Where is 0B/Ot nonzero, in Fig. 7.21b? Exploit the analogy between Faraday's 
law and Ampère’s law to sketch (qualitatively) the electric field. [For a quantitative model, see 
Prob. 7.46.] 
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Problem 7.22. Imagine a uniform magnetic field, pointing in the z direction and filling 
all space (B = Boz). A positive charge is at rest, at the origin. Now somebody turns off 
the magnetic field, thereby inducing an electric field. In what direction does the charge 


move?” 


7.2.3 Inductance 


Suppose you have two loops of wire, at rest (Fig. 7.30). If you run a steady 
current J; around loop 1, it produces a magnetic field Bı. Some of the field 
lines pass through loop 2; let ®2 be the flux of Bı through 2. You might have 
a tough time actually calculating B,, but a glance at the Biot—Savart law, 
Lo dl, x a 
B; = —h Q —— 
l 4n! f ap * 
reveals one significant fact about this field: it is proportional to the current 
I. Therefore, so too is the flux through loop 2: 


M> = i - day. 


® = Mah, (7.31) 


Thus 


where Ma; is the constant of proportionality; it is known as the mutual 
inductance of the two loops. 

There is a cute formula for the mutual inductance, which you can derive 
by expressing the flux in terms of the vector potential, and invoking Stokes’ 


theorem: 
O= | Bida = | Yx Ai): da = Q Ai -dh 


Now, according to Eq. 5.66, 


I dl 
A, = poti aa 
An a 
B, 
Loop 2 dl, 
B, B, 
2 
„LOOP 1 Loop 1 


20 This paradox was suggested by Tom Colbert. Refer to Prob. 2.56. 
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and hence 


uu ($ ay. 
®, = Am $ z dl. 


_ Ho dl, -dh 
Ma = pp (7.32) 


This is the Neumann formula; it involves a double line integral — one 
integration around loop 1, the other around loop 2 (Fig. 7.31). It’s not very 
useful for practical calculations, but it does reveal two important things 
about mutual inductance: 


Evidently, 


(1) Mn isa purely geometrical quantity, having to do with the sizes, shapes, 
and relative positions of the two loops. 

(2) The integral in Eq. 7.32 is unchanged if we switch the roles of loops 1 
and 2; it follows that 


Mr = M2. (7.33) 


This is an astonishing conclusion: Whatever the shapes and positions of 
the loops, the flux through 2 when we run a current I around | is identical 
to the flux through 1 when we send the same current I around 2. We may 
as well drop the subscripts and call them both M. 


Example 7.11. A short solenoid (length / and radius a, with nı turns 
per unit length) lies on the axis of a very long solenoid (radius b, m 
turns per unit length) as shown in Fig. 7.32. Current J flows in the 
short solenoid. What is the flux through the long solenoid? 


Solution. Since the inner solenoid is short, it has a very complicated 
field; moreover, it puts a different flux through each turn of the outer 
solenoid. It would be a miserable task to compute the total flux this 
way. However, if we exploit the equality of the mutual inductances, 
the problem becomes very easy. Just look at the reverse situation: run 
the current J through the outer solenoid, and calculate the flux through 
the inner one. The field inside the long solenoid is constant: 


B= [onal 
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(Eq. 5.59), so the flux through a single loop of the short solenoid is 
Bra? = pomine. 


There are ;/ turns in all, so the total flux through the inner solenoid 
is 

= uona nml. 
This is also the flux a current J in the short solenoid would put through 


the long one, which is what we set out to find. Incidentally, the mutual 
inductance, in this case, is 


M= uona nml. 


Suppose, now, that you vary the current in loop 1. The flux through 
loop 2 will vary accordingly, and Faraday’s law says this changing flux will 
induce an emf in loop 2: 

d®, dl, 
dt dt’ 
(In quoting Eq. 7.31 — which was based on the Biot—Savart law — I am 
tacitly assuming that the currents change slowly enough for the system to 
be considered quasistatic.) What a remarkable thing: every time you change 
the current in loop 1, an induced current flows in loop 2 — even though there 
are no wires connecting them! 

Come to think of it, a changing current not only induces an emf in any 
nearby loops, it also induces an emf in the source loop itself (Fig 7.33). 
Once again, the field (and therefore also the flux) is proportional to the 
current: 


& = (7.34) 


® = LI. (7.35) 


The constant of proportionality L is called the self-inductance (or simply the 
inductance) of the loop. As with M, it depends on the geometry (size and 


shape) of the loop. If the current changes, the emf induced in the loop is 
dI 
==]. x 
E 7 (7.36) 


Inductance is measured in henries (H); a henry is a volt-second per ampere. 
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Example 7.12. Find the self-inductance of a toroidal coil with rectan- 
gular cross section (inner radius a, outer radius b, height /) that carries 
a total of N turns. 


Solution. The magnetic field inside the toroid is (Eq. 5.60) 
uoNI 


2ns ` 


The flux through a single turn (Fig. 7.34) is 


uoNI 1: uoNIh, ,b 
ines = ln(+). 
fe da = 5h -ds = 7 n(-) 


The total flux is N times this, so the self-inductance (Eq. 7.35) is 


_ Hoh n(2). (7.37) 


a 


L 


Inductance (like capacitance) is an intrinsically positive quantity. Lenz’s 
law, which is enforced by the minus sign in Eq. 7.36, dictates that the emf 
is in such a direction as to oppose any change in current. For this reason, it 
is called a back emf. Whenever you try to alter the current in a wire, you 
must fight against this back emf. Inductance plays somewhat the same role 
in electric circuits that mass plays in mechanical systems: the greater L is, 
the harder it is to change the current, just as the larger the mass, the harder 
it is to change an object’s velocity. 


Example 7.13. Suppose a current 7 is flowing around a loop, when 
someone suddenly cuts the wire. The current drops “instantaneously” 
to zero. This generates a whopping back emf, for although J may be 
small, d//dt is enormous. (That’s why you sometimes draw a spark 
when you unplug an iron or toaster — electromagnetic induction is 
desperately trying to keep the current going, even if it has to jump 
the gap in the circuit.) 

Nothing so dramatic occurs when you plug in a toaster or iron. In 
this case induction opposes the sudden increase in current, prescribing 
instead a smooth and continuous buildup. Suppose, for instance, that 
a battery (which supplies a constant emf Eo) is connected to a circuit 
of resistance R and inductance L (Fig. 7.35). What current flows? 
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Solution. The total emf in this circuit is Eọ from the battery plus 
—L(dI/dt) from the inductance. Ohm’s law, then, says”! 

dI 
dt 
This is a first-order differential equation for J as a function of time. 
The general solution, as you can show for yourself, is 


Eo- L— = IR. 


Eo _ 
Í ft) j= + k (R/L)t 
(2) RT“ ; 


where k is a constant to be determined by the initial conditions. In 
particular, if you close the switch at time ¢ = 0, so /(0) = 0, then 
k = —&/ R, and 


I(t) = a ag RDN, (7.38) 


This function is plotted in Fig. 7.36. Had there been no inductance 
in the circuit, the current would have jumped immediately to Eọ/R. 
In practice, every circuit has some self-inductance, and the current 
approaches Eọ/R asymptotically. The quantity t = L/R is the time 
constant; it tells you how long the current takes to reach a substantial 
fraction (roughly two-thirds) of its final value. 


> 


2LIR 3L/R ' 


Problem 7.23. A small loop of wire (radius a) is held a distance z above the center of a large 
loop (radius b), as shown in Fig. 7.37. The planes of the two loops are parallel, and perpendicular 
to the common axis. 


(a) Suppose current Z flows in the big loop. Find the flux through the little loop. (The little loop 


is so small that you may consider the field of the big loop to be essentially constant.) 


21 Notice that —L(dI/dt) goes on the left-hand side of the equation — it is part of the emf that 
establishes the voltage across the resistor. 
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(b) Suppose current J flows in the little loop. Find the flux through the big loop. (The little loop 
is so small that you may treat it as a magnetic dipole.) 
(c) Find the mutual inductances, and confirm that M42 = Ma1. 


: ER i 


l 
Fig. 7.37 Fig. 7.38 


Problem7.24. A square loop of wire, of side a, lies midway between two long wires, 3a apart, 
and in the same plane. (Actually, the long wires are sides of a large rectangular loop, but the 
short ends are so far away that they can be ignored.) A clockwise current Z in the square loop is 
gradually increasing: dZ/dt = k (a constant). Find the emf induced in the big loop. Which way 
will the induced current flow? 


Problem7.25. Find the self-inductance per unit length of a long solenoid, of radius R, 
carrying 7 turns per unit length. 


Problem 7.26. Try computing the self-inductance of the “hairpin” loop shown in Fig. 7.38. 
(Neglect the contribution from the ends; most of the flux comes from the long straight section.) 
You'll run into a snag that is characteristic of many self-inductance calculations. To get a definite 
answer, assume the wire has a tiny radius €, and ignore any flux through the wire itself. 


Problem 7.27. An alternating current [(¢) = Jp cos(wt) (amplitude 0.5 A, frequency 
60 Hz) flows down a straight wire, which runs along the axis of a toroidal coil with rectangular 
cross section (inner radius 1 cm, outer radius 2 cm, height 1 cm, 1000 turns). The coil is 
connected to a 500 Q resistor. 


(a) Inthe quasistatic approximation, what emf is induced in the toroid? Find the current, ZR (£), 
in the resistor. 

(b) Calculate the back emf in the coil, due to the current p(t). What is the ratio of the 
amplitudes of this back emf and the “direct” emf in (a)? 


Problem 7.28. A capacitor C is charged up to a voltage V and connected to an inductor L, 
as shown schematically in Fig. 7.39. At time ¢ = 0, the switch S is closed. Find the current in the 
circuit as a function of time. How does your answer change if a resistor R is included in series 
with Cand L? 


Fig. 7.39 
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7.2.4 Energy in Magnetic Fields 


It takes a certain amount of energy to start a current flowing in a circuit. 
I’m not talking about the energy delivered to the resistors and converted 
into heat — that is irretrievably lost, as far as the circuit is concerned, and 
can be large or small, depending on how long you let the current run. What 
I am concerned with, rather, is the work you must do against the back emf 
to get the current going. This is a fixed amount, and it is recoverable: you 
get it back when the current is turned off. In the meantime, it represents 
energy latent in the circuit; as we’ll see in a moment, it can be regarded as 
energy stored in the magnetic field. 

The work done on a unit charge, against the back emf, in one trip around 
the circuit is —E (the minus sign records the fact that this is the work done by 
you against the emf, not the work done by the emf). The amount of charge 
per unit time passing down the wire is J. So the total work done per unit 
time is 

dw dI 

Fria —EI = LI T 
If we start with zero current and build it up to a final value J, the work done 
(integrating the last equation over time) is 


W = 5 LP. (7.39) 


It does not depend on how Jong we take to crank up the current, only on 
the geometry of the loop (in the form of L) and the final current 7. 

There is a nicer way to write W, which has the advantage that it is 
readily generalized to surface and volume currents. Remember that the flux 
® through the loop is equal to LI (Eq. 7.35). On the other hand, 


o= [ Beda= [Yxa dahaa, 


where the line integral is around the perimeter of the loop. Thus 


Li= A-dl 


and therefore 


W=sIPA-dl= > P(a-Ddl (7.40) 
In this form, the generalization to volume currents is obvious: 
1 
W = 5 [aban (7.41) 
2 Jy 


But we can do even better, and express W entirely in terms of the 
magnetic field: Ampére’s law, V x B = uoJ, lets us eliminate J: 


W = MESEI (7.42) 
2uo 


332 


7 Electrodynamics 


Integration by parts transfers the derivative from B to A; specifically, 
product rule 6 states that 


V-(AxB)=B-(VxA)-A-(V xB), 


SO 
A: (VxB)=B-B-V- (AxB). 


= z| | Pdr- f Y axma] 


: | Í Bdr - ¢ (A xB) - dal, (7.43) 
~ 2mo 


where S is the surface bounding the volume V. 

Now, the integration in Eq. 7.41 is to be taken over the entire volume 
occupied by the current. But any region /arger than this will do just as well, 
since J is zero out there anyway. In Eq. 7.43, the larger the region we pick 
the greater is the contribution from the volume integral, and therefore the 
smaller is that of the surface integral (this makes sense: as the surface gets 
farther from the current, both A and B decrease). In particular, if we agree 
to integrate over all space, then the surface integral goes to zero, and we are 
left with 


Consequently, 


Wax f Bdr. (7.44) 


all space 


In view of this result, we say the energy is “stored in the magnetic field,” 
in the amount (B?/2,10) per unit volume. This is a nice way to think of it, 
though someone looking at Eq. 7.41 might prefer to say that the energy is 
stored in the current distribution, in the amount F(A - J) per unit volume. 
The distinction is one of bookkeeping; the important quantity is the total 
energy W, and we need not worry about where (if anywhere) the energy 
is “located.” (The same issue arose in the case of electrostatic energy — 
Section 2.4.4.) 

You might find it strange that it takes energy to set up a magnetic field — 
after all, magnetic fields themselves do no work. The point is that producing 
a magnetic field, where previously there was none, requires changing the 
field, and a changing B-field, according to Faraday, induces an electric field. 
The latter, of course, can do work. In the beginning, there is no E, and at the 
end there is no E; but in between, while B is building up, there is an E, and it 
is against this that the work is done. (You see why I could not calculate the 
energy stored in a magnetostatic field back in Chapter 5.) In light of this, 
it is extraordinary how similar the magnetic energy formulas are to their 
electrostatic counterparts:” 


22 For an illuminating confirmation of Eq. 7.44, using the method of Prob. 2.45, see T. H. 
Boyer, Am. J. Phys. 69, 1 (2001). 
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Wetec = = {ve dt = = fe dt, (2.43 and 2.45) 
1 o 1l 2 
Wmas = fao dr = zn | # dt. (7.41 and 7.44) 


Example 7.14. A long coaxial cable carries current J (the current flows 
down the surface of the inner cylinder, radius a, and back along the 
outer cylinder, radius b) as shown in Fig. 7.40. Find the magnetic 
energy stored in a section of length /. 


~<=—]/ 


1S 


Elsewhere, the field is zero. Thus, the energy per unit volume is 


1 (ae Lol? 
29 S278? 80252” 


The energy in a cylindrical shell of length /, radius s, and thickness ds, 
then, is 


Integrating from a to b, we have 


Lol] 
W = re In(-). 


By the way, this suggests a very simple way to calculate the self- 
inductance of the cable. According to Eq. 7.39, the energy can also be 
written as LJ*. Comparing the two expressions,” 


ol, sb 

L= = n(-). 
This method of calculating self-inductance is especially useful when 
the current is not confined to a single path, but spreads over some 
surface or volume, so that different parts of the current enclose dif- 
ferent amounts of flux. In such cases, it can be very tricky to get 
the inductance directly from Eq. 7.35, and it is best to let Eq. 7.39 
define L. 


Notice the similarity to Eq. 7.37 — in a sense, the rectangular toroid is a short coaxial cable, 


turned on its side. 
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Problem 7.29. Find the energy stored in a section of length / of a long solenoid (radius R, 
current J, n turns per unit length): (a) using Eq. 7.39 (you found Z in Prob. 7.25), (b) using 
Eq. 7.40 (we worked out A in Ex. 5.12), (c) using Eq. 7.44, and (d) using Eq. 7.43 (take as your 
volume the cylindrical tube from radius a < R out to radius b > R). 


Problem 7.30. Calculate the energy stored in the toroidal coil of Ex. 7.12, by applying Eq. 7.44. 
Use the answer to check Eq. 7.37. 


Problem 7.31. A long cable carries current in one direction uniformly distributed over its 
(circular) cross section. The current returns along the surface (there is a very thin insulating 
sheath separating the currents). Find the self-inductance per unit length. 


STON 
[= 


— 
WM 
> 


Fig. 7.41 


Problem 7.32. Suppose the circuit in Fig. 7.41 has been connected for a long time when 
suddenly, at time ¢ = 0, switch Sis thrown from A to B, bypassing the battery. 


(a) Whatis the current at any subsequent time £? 
(b) Whatis the total energy delivered to the resistor? 
(c) Show that this is equal to the energy originally stored in the inductor. 


Problem 7.33. Two tiny wire loops, with areas a; and a, are situated a displacement 2 apart 
(Fig. 7.42). 


a 


Fig. 7.42 


(a) Find their mutual inductance. [Hint: Treat them as magnetic dipoles, and use Eq. 5.88.] Is 
your formula consistent with Eq. 7.33? 

(b) Suppose a current J; is flowing in loop 1, and we propose to turn on a current Z> in loop 2. 
How much work must be done, against the mutually induced emf, to keep the current 74 
flowing in loop 1? In light of this result, comment on Eq. 6.35. 


Problem 7.34. An infinite cylinder of radius R carries a uniform surface charge o. We propose 
to set it spinning about its axis, at a final angular velocity wp. How much work will this take, 
per unit length? Do it two ways, and compare your answers: 
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(a) Find the magnetic field and the induced electric field (in the quasistatic approximation), 
inside and outside the cylinder, in terms of œw, œ, and s (the distance from the axis). 
Calculate the torque you must exert, and from that obtain the work done per unit length 
(W = f Ndo). 

(b) Use Eq. 7.44 to determine the energy stored in the resulting magnetic field. 


7.3 Maxwell’s Equations 
[SSS aes! 


7.3.1 Electrodynamics before Maxwell 


So far, we have encountered the following laws, specifying the divergence 
and curl of electric and magnetic fields: 


(i) V-E= pleo (Gauss’s law), 
ai) V-B=0 (no name), 
(iii) V x E = —-0B/dt (Faraday’s law), 
(iv) VX B= uoJ (Ampeéere’s law). 


These equations represent the state of electromagnetic theory in the mid- 
nineteenth century, when Maxwell began his work. They were not written 
in so compact a form, in those days, but their physical content was familiar. 
Now, it happens that there is a fatal inconsistency in these formulas. It has 
to do with the old rule that divergence of curl is always zero. If you apply 
the divergence to number (iii), everything works out: 


V-(VxE)=V-(-SS) =-S(y-B), 


The left-hand side is zero because divergence of curl is zero; the right-hand 
side is zero by virtue of equation (ii). But when you do the same thing to 
number (iv), you get into trouble: 


V-(V x B) = uo(V - J); (7.45) 


the left-hand side must be zero, but the right-hand side, in general, is not. 
For steady currents, the divergence of J is zero, but when we go beyond 
magnetostatics Ampére’s law cannot be right. 

There’s another way to see that Ampére’s law is bound to fail for 
nonsteady currents. Suppose we're in the process of charging up a capacitor 
(Fig. 7.43). In integral form, Ampére’s law reads 


$s -dl= Holenc- 


I want to apply it to the Ampérian loop shown in the diagram. How 
do I determine enc? Well, it’s the total current passing through the loop, 
or, more precisely, the current piercing a surface that has the loop for 
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Ampeéerian loop 


x 


Capacitor YZ 


Battery 


its boundary. In this case, the simplest surface lies in the plane of the 
loop — the wire punctures this surface, so Tenc = Z. Fine — but what if I draw 
instead the bubble-shaped surface in Fig. 7.43? No current passes through 
this surface, and I conclude that Ine = 0! We never had this problem in 
magnetostatics because the conflict arises only when charge is piling up 
somewhere (in this case, on the capacitor plates). But for nonsteady currents 
(such as this one) “the current enclosed by the loop” is an ill-defined notion; 
it depends entirely on what surface you use. (If this seems pedantic to 
you — “obviously one should use the plane surface” — remember that the 
Ampérian loop could be some contorted shape that doesn’t even lie in a 
plane.) 

Of course, we had no right to expect Ampére’s law to hold outside of 
magnetostatics; after all, we derived it from the Biot—Savart law. However, 
in Maxwell’s time there was no experimental reason to doubt that Ampére’s 
law was of wider validity. The flaw was a purely theoretical one, and 
Maxwell fixed it by purely theoretical arguments. 


7.3.2 How Maxwell Fixed Ampere’s Law 


The problem is on the right-hand side of Eq. 7.45, which should be zero, 
but isn’t. Applying the continuity equation (Eq. 5.29) and Gauss’s law, the 
offending term can be rewritten: 
Op ô ðE 
Volos y eN e (e057): 
If we were to combine eọ(0E/ôt) with J, in Ampère’s law, it would be just 
right to kill off the extra divergence: 


E 
VxB= Hod + poco se. (7.46) 


(Maxwell himself had other reasons for wanting to add this quantity 
to Ampére’s law. To him, the rescue of the continuity equation was a 
happy dividend rather than a primary motive. But today we recognize this 
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argument as a far more compelling one than Maxwell’s, which was based 
on a now-discredited model of the ether.”*) 

Such a modification changes nothing, as far as magnetostatics is con- 
cerned: when E is constant, we still have V x B = uoJ. In fact, Maxwell’s 
term is hard to detect in ordinary electromagnetic experiments, where it 
must compete for attention with J — that’s why Faraday and the others 
never discovered it in the laboratory. However, it plays a crucial role in the 
propagation of electromagnetic waves, as we'll see in Chapter 9. 

Apart from curing the defect in Ampére’s law, Maxwell’s term has a 
certain aesthetic appeal: just as a changing magnetic field induces an electric 
field (Faraday’s law), so?’ 


a changing electric field induces a magnetic field. 


Of course, theoretical convenience and aesthetic consistency are only sug- 
gestive — there might, after all, be other ways to doctor up Ampére’s law. 
The real confirmation of Maxwell’s theory came in 1888 with Hertz’s 
experiments on electromagnetic waves. 

Maxwell called his extra term the displacement current: 


ðE 
J= ezr (7.47) 
(It’s a misleading name; eọ(0E/ôt) has nothing to do with current, except 
that it adds to J in Ampère’s law.) Let’s see now how displacement current 
resolves the paradox of the charging capacitor (Fig. 7.43). If the capacitor 
plates are very close together (I didn’t draw them that way, but the calcula- 


tion is simpler if you assume this), then the electric field between them is 


where Qis the charge on the plate and A is its area. Thus, between the plates, 
OE 1 dQ 1 
ar €0A dt E EoA 

Now, the “new, improved” Ampère’s law (Eq. 7.46) reads, in integral form, 


$ B- dl = polenc + Ho€o { (5) - da. (7.48) 


If we choose the flat surface, then E = 0 and kne = Z. If, on the other hand, 
we use the balloon-shaped surface, then enc = 0, but f (OE/0t) - da = Ieg. 
So we get the same answer for either surface, though in the first case it comes 
from the conduction current, and in the second from the displacement 
current. 


24 For the history of this subject, see A. M. Bork, Am. J. Phys. 31, 854 (1963). 

25 See footnote 11 for commentary on the word “induce.” The same issue arises here: Should 
a changing electric field be regarded as an independent source of magnetic field (along with 
current)? In a proximate sense it does function as a source, but since the electric field itself 
was produced by charges and currents, they alone are the “ultimate” sources of E and B. 
See S. E. Hill, Phys. Teach. 49, 343 (2011); for a contrary view, see C. Savage, Phys. Teach. 
50, 226 (2012). 


7 Electrodynamics 


Example 7.15. Imagine two concentric metal spherical shells 
(Fig. 7.44). The inner one (radius a) carries a charge Q(t), and the 
outer one (radius b) an opposite charge —Q(t). The space between 
them is filled with ohmic material of conductivity ø, so a radial current 


flows: 
f, 1=-0= | J-da= £. 
€0 


This configuration is spherically symmetrical, so the magnetic field 
has to be zero (the only direction it could possibly point is radial, 
andV-B=0 => ¢B-da = Bnr) = 0, so B = 0). What? I 
thought currents produce magnetic fields! Isn’t that what Biot—-Savart 
and Ampère taught us? How can there be a J with no accompanying 
B? 


Solution. This is not a static configuration: Q, E, and J are all func- 
tions of time; Ampère and Biot—Savart do not apply. The displacement 
current 


dE 10. Os 


> f= -0 
ðt 4nr? 4T Eor? 


Ja = €o 


exactly cancels the conduction current (in Eq. 7.46), and the magnetic 
field (determined by V - B = 0, V x B = 0) is indeed zero. 


Problem 7.35. A fat wire, radius a, carries a constant current Z, uniformly distributed over its 
cross section. A narrow gap in the wire, of width w < a, forms a parallel-plate capacitor, as 
shown in Fig. 7.45. Find the magnetic field in the gap, ata distance s < a from the axis. 


=S 
a tg. 
I— : I— 
Fig. 7.45 


Problem 7.36. The preceding problem was an artificial model for the charging capacitor, 
designed to avoid complications associated with the current spreading out over the surface of 
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the plates. For amore realistic model, imagine thin wires that connect to the centers of the plates 
(Fig. 7.46a). Again, the current Z is constant, the radius of the capacitor is a, and the separation 
of the plates is w < a. Assume that the current flows out over the plates in such a way that 
the surface charge is uniform, at any given time, and is zero at ¢ = 0. 


(a) Find the electric field between the plates, as a function of t. 

(b) Find the displacement current through a circle of radius s in the plane midway between the 
plates. Using this circle as your “Ampérian loop,’ and the flat surface that spans it, find the 
magnetic field at a distance s from the axis. 

(c) Repeat part (b), but this time use the cylindrical surface in Fig. 7.46b, which is open at 
the right-hand end and extends to the left through the plate and terminates outside the 
capacitor. Notice that the displacement current through this surface is zero, and there are 
two contributions to Zenc.7° 


(a) (b) 
Fig. 7.46 


Problem 7.37. Refer to Prob. 7.17, to which the correct answer was 
Holow 
27 
(a) Find the displacement current density J 7. 
(b) Integrate it to get the total displacement current, 


y= | Sada 


(c) Compare 7, and 7. (What's their ratio?) If the outer cylinder were, say, 2 mm in diameter, 
how high would the frequency have to be, for 7, to be 1 percent of J ? [This problem is 
designed to indicate why Faraday never discovered displacement currents, and why it is 
ordinarily safe to ignore them unless the frequency is extremely high.] 


a 


E(s, t) = sin(wt) In(—)a. 


Problem 7.38. A conducting spherical shell (radius R, centered at the origin) is being charged 
by a steady current J flowing up the z-axis and connected to the “south pole,’ where it 
spreads out uniformly over the surface (see Fig. 7.47). The total charge on the sphere, at time 
t, is Q(t) = It. Find the magnetic field at a point (r, O, œ) exterior to the sphere. First 
decide what direction you think B points; then apply the Ampére/Maxwell law in integral form 


26 This problem raises an interesting quasi-philosophical question: If you measure B in the 


laboratory, have you detected the effects of displacement current (as (b) would suggest), or 
merely confirmed the effects of ordinary currents (as (c) implies)? See D. F. Bartlett, Am. J. 
Phys. 58, 1168 (1990). 
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(Eq. 7.48) to an appropriate Amperian loop. In fact, do it three different ways: using (as the 
surface bounded by the Ampérian loop) (a) a spherical cap north of the loop, (b) a spherical 
bowl south of the loop, and (c) a cone with vertex at the origin. (d) What does your result say 
for O very near to 77?” 


7.3.3 Maxwell's Equations 


In the last section we put the finishing touches on Maxwell’s equations:7* 


i 1 
Gi) V-E= =P (Gauss’s law), 
0 
ai) V-B=0 (no name), 
a OB , 
(iii) VxE= ae (Faraday’s law), (7.49) 


Í E 
(iv) VXB= uoJ + poco se (Ampeéere’s law with 


Maxwell’s correction). 


Together with the Lorentz force law, 
F = q(E+vxB), (7.50) 


they summarize the entire theoretical content of classical electrodynamics”? 
(save for some special properties of matter, which we encountered in 
Chapters 4 and 6). Even the continuity equation, 


Yotso (7.51) 


27 T. H. Boyer, Am. J. Phys. 87, 729 (2019). 

28 Maxwell didn’t have access to modern vector notation (in particular, the del operator), and 
in his formulation there were actually 20 equations — including the potentials as well as the 
fields. It was Oliver Heaviside who distilled them into the tidy package we now know as 
“Maxwell’s equations.” See B. J. Hunt, Physics Today November 2012, p. 48. 

29 Like any differential equations, Maxwell’s must be supplemented by suitable boundary 
conditions. Because these are typically “obvious” from the context (for example, E and B 
go to zero at large distances from a localized charge distribution), it is easy to forget that 
they play an essential role. 
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which is the mathematical expression of conservation of charge, can be 
derived from Maxwell’s equations by applying the divergence to num- 
ber (iv). 

I have written Maxwell’s equations in the traditional way, which empha- 
sizes that they specify the divergence and curl of E and B. In this form, they 
reinforce the notion that electric fields can be produced either by charges (p) 
or by changing magnetic fields (OB/01), and magnetic fields can be produced 
either by currents (J) or by changing electric fields (OE/0t). Actually, this 
is misleading, because 0B/dt and 0E/0t are themselves due to charges and 
currents. I think it is logically preferable to write 


@ v-E=_,, (ii) vxE+ Ë <0, 
£0 i Pe (7.52) 
(ii) V-B=0, (iv) VxB- meo = uoJ, 


with the fields (E and B) on the left and the sources (pọ and J) on the 
right. This notation emphasizes that all electromagnetic fields are ultimately 
attributable to charges and currents. Maxwell’s equations tell you how 
charges produce fields; reciprocally, the force law tells you how fields affect 
charges. 


Problem 7.39. Suppose 


1 7 
Er) = — Lot- r)f, BM, t) =0. 
Amey r2 
(The theta function is defined in Prob. 1.46b.) Show that these fields satisfy all of Maxwell's 


equations, and determine p and J. Describe the physical situation that gives rise to these fields. 


7.3.4 Magnetic Charge 


There is a pleasing symmetry to Maxwell’s equations; it is particularly 
striking in free space, where p and J vanish: 


V-E=0, vxr=-%, 
Ot 
V-B=0, Vx B= poco. 


If you replace E by B and B by —poe€oE, the first pair of equations turns 
into the second, and vice versa. This symmetry? between E and B is 
spoiled, though, by the charge term in Gauss’s law and the current term in 


Ampeére’s law. You can’t help wondering why the corresponding quantities 
are “missing” from V - B = 0 and V x E = —0B/dt. What if we had 


30 Don’t be distracted by the pesky constants uo and €o; these are present only because the 
SI system measures E and B in different units, and would not occur, for instance, in the 
Gaussian system. 
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visi. tin VxE=-Holn- Z, 
i f (7.53) 
(ii) V-B= HoPm (iv) VxB= Lode + Moeog: 


Then pm would represent the density of magnetic “charge,” and pe the 
density of electric charge; Jm would be the current of magnetic charge, and 
J, the current of electric charge. Both charges would be conserved: 


“Po and V- Je = -Pe 
The former follows by application of the divergence to (iii), the latter by 
taking the divergence of (iv). 

In a sense, Maxwell’s equations beg for magnetic charge to exist — it 
would fit in so nicely. And yet, in spite of a diligent search, no one has 
ever found any.*! As far as we know, Pm is zero everywhere, and so is 
Jm; B is not on equal footing with E: there exist stationary sources for E 
(electric charges) but none for B. (This is reflected in the fact that magnetic 
multipole expansions have no monopole term, and magnetic dipoles consist 
of current loops, not separated north and south “poles.”) Apparently God 
just didn’t make any magnetic charge. (In quantum electrodynamics, by the 
way, it’s a more than merely aesthetic shame that magnetic charge does not 
seem to exist: Dirac showed that the existence of magnetic charge would 
explain why electric charge is quantized. See Prob. 8.21.) 


Vedy- (7.54) 


Problem7.40. Assuming that “Coulomb's law’ for magnetic charges (qm) reads 


F= = i h, (7.55) 


work out the force law for a monopole qm moving with velocity v through electric and magnetic 
fields E and B.* 


Problem 7.41. Suppose a magnetic monopole qm passes through a resistanceless loop of wire 
with self-inductance Z. What current is induced in the loop?” 


7.3.5 Maxwell's Equations in Matter 


Maxwell’s equations in the form 7.49 are complete and correct as they 
stand. However, when you are working with materials that are subject to 
electric and magnetic polarization there is a more convenient way to write 


3! For an extensive bibliography, see A. S. Goldhaber and W. P. Trower, Am. J. Phys. 58, 429 
(1990). 

32 For interesting commentary, see W. Rindler, Am. J. Phys. 57, 993 (1989). 

33 This is one of the methods used to search for monopoles in the laboratory; see B. Cabrera, 
Phys. Rev. Lett. 48, 1378 (1982). 
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da, 
+0, 
~0, 


them. For inside polarized matter there will be accumulations of “bound” 
charge and current, over which you exert no direct control. It would be nice 
to reformulate Maxwell’s equations so as to make explicit reference only to 
the “free” charges and currents. 

We have already learned, from the static case, that an electric polarization 
P produces a bound charge density 


pp=-V-P (7.56) 


(Eq. 4.12). Likewise, a magnetic polarization (or “magnetization”) M 
results in a bound current 


Jp, =VXM (7.57) 


(Eq. 6.13). There’s just one new feature to consider in the nonstatic case: 
any change in the electric polarization involves a flow of (bound) charge, 
which must be included in the total current. For suppose we examine a 
tiny chunk of polarized material (Fig. 7.48). The polarization introduces 
a charge density op = P at one end and —o, at the other (Eq. 4.11). If P 
now increases a bit, the charge on each end increases accordingly, giving a 
net current 


E 004 _ OP 
dI = gr Ws = oy 44s: 
The current density, therefore, is 
oP 
) = —. 7.58 
J, Ot ( ) 


This polarization current has nothing to do with the bound current Jp. The 
latter is associated with magnetization of the material and involves the spin 
and orbital motion of electrons; Jp, by contrast, is the result of the linear 
motion of charge when the electric polarization changes. If P points to 
the right, and is increasing, then each plus charge moves a bit to the right 
and each minus charge to the left; the cumulative effect is the polarization 
current Jp. 

We ought to check that Eq. 7.58 is consistent with the continuity equa- 
tion: 


Ot Ot ðt ` 
Yes: The continuity equation is satisfied; in fact, J, is essential to ensure the 
conservation of bound charge. (Incidentally, a changing magnetization does 
not lead to any analogous accumulation of charge or current. The bound 
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current J} = V xM varies in response to changes in M, to be sure, but that’s 
about it.) 
In view of all this, the total charge density can be separated into two 
parts: 
P=prt+po=pr—-V-P, (7.59) 
and the current density into three parts: 
oP 


ie +} ae ae eo (7.60) 
Gauss’s law can now be written as 
V-E= 2 ppoVe By, 
E0 ` 
or 
V-D= pr, (7.61) 
where, as in the static case, 
D = €E +P. (7.62) 


Meanwhile, Ampère’s law (with Maxwell ’s term) becomes 


P E 
VxB= po(J +Y xM + E) + peo, 


or 
visja, (7.63) 
Ot 
where, as before, 
H= 1,3 - M. (7.64) 
Lo 


Faraday’s law and V - B = 0 are not affected by our separation of charge 
and current into free and bound parts, since they do not involve p or J. 
In terms of free charges and currents, then, Maxwell’s equations read 


(i) V-D=py, (iii) vibes 
i Ot 
aD (7.65) 
(ii) V-B=0, (iv) VxH=Jp+ =. 


Some people regard these as the “true” Maxwell’s equations, but please 
understand that they are in no way more “general” than Eq. 7.49; they 
simply reflect a convenient division of charge and current into free and 
nonfree parts. And they have the disadvantage of hybrid notation, since 
they contain both E and D, both B and H. They must be supplemented, 
therefore, by appropriate constitutive relations, giving D and H in terms of 
E and B. These depend on the nature of the material; for linear media, 


P=e)x-E and M=y,H, (7.66) 
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so 
1 
D=ceE and H= qP (7.67) 
L 
where €=e€9(1+ yx.) and u= uo(l + Xm). You may remember that D is 
called the electric “displacement”; that’s why the second term in the 


Ampére/Maxwell equation (iv) came to be called the displacement current. 
In this context, 


pS (7.68) 


Problem 7.42. Sea water at frequency v = 4 x 108 Hz has permittivity € = 81 €ọ, perme- 
ability u = uo, and resistivity p = 0.23 Q - m. What is the (amplitude) ratio of conduction 
current to displacement current in sea water, at this frequency? [Hint: Consider a parallel-plate 
capacitor immersed in sea water and driven by a voltage Vo cos(27v7).] 


7.3.6 Boundary Conditions 


In general, the fields E, B, D, and H will be discontinuous at a boundary 
between two different media, or at a surface that carries a charge density 
o or acurrent density K. The explicit form of these discontinuities can be 
deduced from Maxwell’s equations (Eq. 7.65), in their integral form 


(i) $ D ` da = OF us 


(ii) P B-da=0, 


d 
iii E-di=-< | B-d 
” $, dt f. r for any surface S bounded by 
vi $ 1 ae d f D- da, the closed loop P. 
P dt Js 


Applying (i) to a tiny, wafer-thin Gaussian pillbox extending just slightly 
into the material on either side of the boundary (Fig. 7.49), we obtain 


over any closed surface S. 


Dabove + a — Dbeiow -a = Ofa. 


(The positive direction for a is “upward.” ) The edge of the wafer contributes 
nothing in the limit as the thickness goes to zero; nor does any volume charge 
density.) Thus, the component of D that is perpendicular to the interface is 
discontinuous in the amount 

DŁ Deis = If- (7.69) 


above — 


Identical reasoning, applied to equation (ii), yields 


By. — Be 


above below 


=0. (7.70) 
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D 


above 


Dyelow 


Turning to (iii), a very thin Ampérian loop straddling the surface gives 


d 
Eabove -l- Ebelow ‘l= a f - da. 


But in the limit as the width of the loop goes to zero, the flux vanishes. 
(I have already dropped the contribution of the two ends to $ E- dl, on the 
same grounds.) Therefore, 


= E! 


below 


E! 


above 


=0. (7.71) 


That is, the components of E parallel to the interface are continuous across 
the boundary. By the same token, (iv) implies 


Habove 4q- Hbelow ‘l= Tires 
where Iin is the free current passing through the Ampèrian loop. No volume 
current density will contribute (in the limit of infinitesimal width), but 
a surface current can. In fact, if ñ is a unit vector perpendicular to the 
interface (pointing “upward”), so that (i x 1) is normal to the Ampérian 
loop (Fig. 7.50), then 
Tine = Ky - (a xD = (Ky x ñ) «1, 


and hence 


= H! 


below 


H! 


above 


=K; xû. (7.72) 


So the parallel components of H are discontinuous by an amount propor- 
tional to the free surface current density. 

Equations 7.69-7.72 are the general boundary conditions for electrody- 
namics. In the case of linear media, they can be expressed in terms of E and 
B alone: 
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x A L —_ 
(i) Eabove “above = EbelowE below =0f, 


(ii) Bibove E Bretow = 0, 

eet o WHE 
Gii) E bove = Blew = 0, ( ) 
l pil l pl - 

(iv) (bates above Hbelow below — Ky xn. 


In particular, if there is no free charge or free current at the interface, then 


. Ñ L E 
(i) Eabove “above = EbelowE below =0, 


(ii) Bios ~ Bretow = 0, 
oa ll ll = 74 
Gii) E bove 7 E elow E 0, q i ) 
1 1 
: B! = ll = 0. 
(iv) above above Hbelow below 


As we shall see in Chapter 9, these equations are the basis for the theory of 
reflection and refraction. 


7.4 The Field of a Rotating Magnet 


Imagine an axially symmetric magnet rotating at constant angular velocity 
about its axis — a cylindrical bar magnet, if you like, or for that matter the 
Earth itself (pretending that the magnetic axis coincides with the rotation 
axis). It is natural to wonder whether its magnetic field is stationary, or 
rotates with the magnet (or, as Arlo Guthrie would say, “Maybe it doesn’t 
even matter”). Are the field lines “dragged along” by the rotation of 
the source? Faraday did a series of experiments (variations on the disk 
experiment of Ex. 7.4) that appeared to settle the issue. Mounting the 
magnet along the axis of the disk he first held the magnet still while he 
rotated the disk (that’s the original version), and a current was generated 
in the circuit (Fig. 7.51). Then he allowed the magnet to rotate at the same 
rate as the disk; again a current was generated. Finally, he held the disk 
at rest and rotated the magnet. This time vo current flowed. He concluded 
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o Œ « i> 


that the field of the rotating magnet is the same as that of the stationary 
magnet: that’s why the first two experiments give the same result, and the 
third is just what you’d get if the magnet were at rest. Apparently a magnet 
rotating about its axis of symmetry does not drag the field around as it spins; 
the field lines are stationary.*4 

And yet, if the magnet rotates about an axis perpendicular to its axis of 
symmetry, then it certainly does drag the field along with it. The field lines 
loop around from the north pole to the south pole, and that direction keeps 
changing. So not everyone accepted Faraday’s conclusion, until 1912 when 
experiments by S. J. Barnett finally led to a consensus that Faraday was 
right — in particular, the Earth’s magnetic field (or more precisely the part 
of it associated with rotation about the geographic north-south axis) does 
not rotate with the Earth.* 

But what exactly does it mean to say that the field lines do (or do not) 
rotate with their source? Faraday had a very concrete conception of field 
lines as a network of “wires” (or cobwebs) existing “out there” in space. 
In that case, to say that they are moving (or not) makes perfect sense. But 
in Maxwell’s theory it is fields that have real physical existence, and field 
lines are just constructs we devise (by joining up the arrows that represent 
the fields themselves) to help us visualize (and draw) the fields. And we 
have already learned that even static magnetic field lines can be problematic 
(Section 5.3.4). 

Consider for a moment the electrical analog. Imagine a point charge at 
rest at the origin. Its (electric) field “radiates” out in all directions, and it is 
(obviously) constant in time (Fig. 7.52). The field lines are like spokes on a 


n" 


t=0 t=1 t=2 


34 If the magnetic field lines were moving, they would presumably have induced an electric 
field, just as they did in Fig. 7.21b. 
35 For the full history, see C. F. Chyba and K. P. Hand, Phys. Rev. Appl. 6, 014017 (2016). 
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wheel (or rather pins in a pin cushion, in three dimensions). A moment later 
the picture looks exactly the same. But suppose I ask which field line at t = 1 
corresponds to the field line I labeled a at t = 0? It’s a’, obviously (I hear you 
say). Not so fast! Why not a”? That would still be a perfectly good field-line 
pattern, representing exactly the same field. The /abeling of the field lines 
is completely arbitrary, with no physical content at all. If you prefer to say 
that the field line called a (at t = 0) “corresponds” to the field line a” at 
t = 1, and then to a’” at t = 2, and so on, more power to you! For you, the 
field lines rotate, even though the source does not. It’s not wrong, exactly (it 
describes the field correctly at every moment), it’s just dumb — a complicated 
and confusing way to think about something that is intrinsically simple and 
straightforward. 

But suppose instead of a point charge we have a uniformly charged 
sphere, and it’s (uniformly) rotating. The electric field is exactly the same (for 
points outside), and it’s still static (because the charge density is constant in 
time). It is now accompanied by a (static) magnetic field, but that doesn’t 
alter the electric field. Now, however, it is not quite so dumb to wonder 
whether the electric field lines rotate with the sphere. Well, let ’em rotate, if 
it pleases you (you can even have them rotate at some completely different 
rate, about some other axis, if you want to be really perverse); this has 
nothing to do with physics — they represent the same electric field — it 
just has to do with how you choose to direct the field-line movie. Arlo 
was right: it doesn’t even matter. Field lines tell you two things: their 
density tells you the magnitude of the field, and the arrow tells you its 
direction. As long as they get those two things right, you can’t ask for more, 
because that’s all there is to know about the fields. Which field line at one 
moment corresponds to which field line at some other moment... that’s 
a metaphysical question; it has no “right” answer, because it is devoid of 
physical content. And for electric fields that’s pretty much the end of the 
story. 

But the magnetic case is more pernicious, because of the v in the 
Lorentz force law. That’s the velocity of the charge ... but relative to what? 
Nowadays we would answer “relative to whatever inertial reference frame 
you choose to work in — the frame in which the magnetic field is B.” But if 
you conceive of magnetic field lines as “wires” out there in space, moving 
around in time, it is easy to persuade yourself (at any rate, it was in the 
nineteenth century) that v must be the velocity of the charge relative to 
the field line at that location. (Remember those electrical engineers talking 
about “cutting” field lines.) And if that is the case (which, I hasten to say, it 
is not*°) then it matters critically whether the field lines are dragged along 
with the rotation of the magnet. If they are dragged along, then Faraday 
should have found the same current when the magnet rotated and the disk 
was at rest as he did when the disk rotated and the magnet was at rest, and 


36 Footnote 34 was just plain wrong: moving magnetic field lines don’t induce electric fields — 
time-dependent fields do. 
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he should have seen no current when the two rotated together. But that’s 
the opposite of what he actually observed, so he concluded that the field 
lines are not dragged along by the rotation of the magnet. And similarly 
Barnett concluded that the Earth does not drag its magnetic field around as 
it rotates. It’s not an unreasonable inference, and in practice it is sometimes 
an illuminating way to picture how the field evolves in time. That’s why 
geophysicists, plasma physicists, and astrophysicists use this language. But 
it is a choice, not something forced by the laws of electrodynamics, so 
when you read about field lines being “dragged” around, ask yourself 
what precisely the author means by the statement, and remind yourself 
that field lines don’t come in different colors (or carry serial numbers) 
so you could keep track of individual lines from frame to frame in the 
movie. 


More Problems on Chapter 7 


Problem 7.43. Two long, straight copper pipes, each of radius a, are held a distance 2d apart 
(see Fig. 7.53). One is at potential Vo, the other at — Vo. The space surrounding the pipes is 
filled with weakly conducting material of conductivity o. Find the current per unit length that 
flows from one pipe to the other. [Hint: Refer to Prob. 3.14.] 


v Y 
Fig. 7.53 


Problem 7.44. A rare case in which the electrostatic field E for a circuit can actually be 
calculated is the following: Imagine an infinitely long cylindrical sheet, of uniform resistivity 
and radius a. A slot (corresponding to the battery) is maintained at + Vo/2, at b= + m, 
and a steady current flows over the surface, as indicated in Fig. 7.54. According to Ohm's 
law, then, 

V(a,ġ)= an (T < < +7). 

(a) Use separation of variables in cylindrical coordinates to determine V (s, œ) inside and 
outside the cylinder. [Answer: (Vo/7c) tan™! [(s sin @)/(a+scos @)] (fors < a); 
(Vol) tan”! [(asin ġ)/(s + acos@)] (fors > a).] 

(b) Find the surface charge density on the cylinder. [Answer: (€o Vo/7¢a) tan(@/2).] 


37 See M. A. Heald, Am. J. Phys. 52, 522 (1984). See also J. A. Hernandes and A. K. T. Assis, 
Phys. Rev. E 68, 046611 (2003). 
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Fig. 7.54 


Problem 7.45. The magnetic field outside a long straight wire carrying a steady current J is 


_ Hol ; 
ons 
The electric field inside the wire is uniform: 
Ip 
B = 2 
Ta? 


where p is the resistivity and a is the radius (see Exs. 7.1 and 7.3). Question: What is the electric 
field outside the wire?*® The answer depends on how you complete the circuit. Suppose the 
current returns along a perfectly conducting grounded coaxial cylinder of radius b (Fig. 7.55). 
Inthe region a < s < b, the potential V (s, z) satisfies Laplace's equation, with the boundary 
conditions 


De E ti) V2) =0. 
TA 
2; 
2 ak | I — £ 


| 


Fig. 7.55 


This does not suffice to determine the answer — we still need to specify boundary conditions at 
the two ends (though for a /ong wire it shouldn't matter much). In the literature, it is customary 
to sweep this ambiguity under the rug by simply stipulating that V (s, z) is proportional to 
z: V(s,z) = zf (s). On this assumption: 


(a) determine f(s); 
(b) find E(s, z); 
(c) calculate the surface charge density ø (z) on the wire. 


38 This is a famous problem, first analyzed by Sommerfeld and others, and known in its 
most recent incarnation as Merzbacher’s puzzle. See A. Sommerfeld, Electrodynamics, 
Academic Press, New York (1952), p. 125; E. Merzbacher, Am. J. Phys. 48, 178 (1980); 
further references in R. N. Varnay and L. H. Fisher, Am. J. Phys. 52, 1097 (1984). 
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[Answer: V = (-Izp/na?) [In(s/b)/ In(a/b)]. This is a peculiar result, since Æș and o (z) 
are not independent of z — as one would certainly expect for a truly infinite wire.] 


Problem 7.46. Refer to Prob. 7.21 and Fig. 7.21b. As a quantitative model,” let the magnetic 
field be uniform (— Boz) in the shaded region (width w), but taper off linearly in a slab of width 
d at the left- and right-hand surfaces (Fig. 7.56). Choose axes with x horizontal (to the right), 
y vertical (upward), and z coming out of the page, with the origin at the center of the left end 
of the wire loop. Set the clock so the left edge of the magnet passes the origin at £ = 0. 


Fig. 7.56 


(a) What is B(x, £) in each of the five regions? (Let the magnet extend indefinitely in the y 
and z directions.) 

(b) Differentiate to get OB/0z, and specify the regions in which it is nonzero. 

(c) Find the induced electric field in region 3, and the resulting emf in the loop. Is it what you 
expected? 


Problem 7.47. In a perfect conductor, the conductivity is infinite, so E = 0 (Eq. 7.3), 
and any net charge resides on the surface (just as it does for an imperfect conductor, in 
electrostatics). 


(a) Show that the magnetic field is constant (OB/0t = 0), inside a perfect conductor. 
(b) Show that the magnetic flux through a (stationary) perfectly conducting loop is constant. 


A superconductor is a perfect conductor with the additional property that (the constant) B 
inside is in fact zero. (This “flux exclusion” is known as the Meissner effect.) 


(c) Show that the current in a superconductor is confined to the surface. 

(d) Superconductivity is lost above a certain critical temperature (T), which varies from one 
material to another. Suppose you had a sphere (radius a) above its critical temperature, and 
you held it in a uniform magnetic field Boz while cooling it below T+. Find the induced 
surface current density K, as a function of the polar angle 0. 


39 This problem was suggested by Martin Ligare. 
40 The Meissner effect is sometimes referred to as “perfect diamagnetism,” in the sense that 


the field inside is not merely reduced, but canceled entirely. However, the surface currents 
responsible for this are free, not bound, so the actual mechanism is quite different. 
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Problem 7.48. A familiar demonstration of superconductivity (Prob. 7.47) is the levitation of 
a magnet over a piece of superconducting material. This phenomenon can be analyzed using 
the method of images.“ Treat the magnet as a perfect dipole m, a height z above the origin 
(and constrained to point in the z direction), and pretend that the superconductor occupies the 
entire half-space below the xy plane. Because of the Meissner effect, B = O for z < 0, and 
since B is divergenceless, the normal (z) component is continuous, so B- = O just above the 
surface. This boundary condition is met by the image configuration in which an identical dipole 
is placed at —z, as a stand-in for the superconductor; the two arrangements therefore produce 
the same magnetic field in the region z > 0. 


(a) Which way should the image dipole point (+z or —z)? 

(b) Find the force on the magnet due to the induced currents in the superconductor (which 
is to say, the force due to the image dipole). Set it equal to Mg (where M is the mass 
of the magnet) to determine the height / at which the magnet will “float.” [Hint: Refer to 
Prob. 6.3.] 

(c) The induced current on the surface of the superconductor (the xy plane) can be 
determined from the boundary condition on the tangential component of B (Eq. 5.76): 
B = uo (K x z). Using the field you get from the image configuration, show that 


— 3mrh 3 
o 2n(r? + h52 


where r is the distance from the origin. 


Problem 7.49. If a magnetic dipole levitating above an infinite superconducting plane 
(Prob. 7.48) is free to rotate, what orientation will it adopt, and how high above the surface 
will it float? 


Problem 7.50. A perfectly conducting spherical shell of radius a rotates about the z-axis 
with angular velocity œw, in a uniform magnetic field B = Boz. Calculate the emf developed 
between the “north pole” and the equator. (Imagine that there are sliding contacts, as in 
Fig, 7.15.) [Answer: 5 Bo wa?] 


Problem 7.51. Refer to Prob. 7.11 (and use the result of Prob. 5.45): How long does it take a 
falling circular ring (radius a, mass mm, resistance R) to cross the bottom of the magnetic field 
B, at its (changing) terminal velocity? 


Problem 7.52. 


(a) Referring to Prob. 5.55a and Eq. 7.27, show that 


OA 
E = -— : 
ar? (7.75) 
for Faraday-induced electric fields. Check this result by taking the divergence and curl of 


both sides. 


41 W, M. Saslow, Am. J. Phys. 59, 16 (1991). 
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(b) Aspherical shell of radius R carries a uniform surface charge ø. It spins about a fixed axis at 
an angular velocity œ (z) that changes slowly with time. Find the electric field inside and 
outside the sphere. [Hint: There are two contributions here: the Coulomb field due to the 
charge, and the Faraday field due to the changing B. Refer to Ex. 5.11.] 


Problem 7.53. Electrons undergoing cyclotron motion can be sped up by increasing the 
magnetic field; the accompanying electric field will impart tangential acceleration. This is the 
principle of the betatron. One would like to keep the radius of the orbit constant during 
the process. Show that this can be achieved by designing a magnet such that the average 
field over the area of the orbit is twice the field at the circumference (Fig. 7.57). Assume the 
electrons start from rest in zero field, and that the apparatus is symmetric about the center 
of the orbit. (Assume also that the electron velocity remains well below the speed of light, so 
that nonrelativistic mechanics applies.) [Hint: Differentiate Eq. 5.3 with respect to time, and use 
F=ma=qE\] 


Electron B 
orbit 


<y 


Fig. 7.57 Fig. 7.58 


Problem 7.54. An infinite wire carrying a constant current Z in the z direction is moving in 
the y direction at a constant speed v. Find the electric field, in the quasistatic approximation, at 
the instant the wire coincides with the z-axis (Fig. 7.58). [Answer: —( uo /v/27ts) sin œ Z.] 


Problem 7.55. An atomic electron (charge q) circles about the nucleus (charge Q) in an orbit 
of radius 7; the centripetal acceleration is provided, of course, by the Coulomb attraction of 
opposite charges. Now a small magnetic field dB is slowly turned on, perpendicular to the 
plane of the orbit. Show that the increase in kinetic energy, dT, imparted by the induced 
electric field, is just right to sustain circular motion at the same radius r. (That's why, in my 
discussion of diamagnetism, | assumed the radius is fixed. See Section 6.1.3 and the references 
cited there.) 


Problem 7.56. The current in a long solenoid is increasing linearly with time, so the flux is 
proportional to £: © = az. Two voltmeters are connected to diametrically opposite points (4 
and B), together with resistors (R; and R3), as shown in Fig. 7.59. What is the reading on 
each voltmeter? Assume that these are ideal voltmeters that draw negligible current (they have 
huge internal resistance), and that a voltmeter registers — f E - dl between the terminals 


More Problems on Chapter 7 


and through the meter. [Answer: V1 = a. Rı/ (R1 + R2); V2 = — a Ro/( Ry + R2). Notice 
that V1 # Va, even though they are connected to the same points!“”] 


I 


Solenoid 


Problem 7.57. A circular wire loop (radius r, resistance R) encloses a region of uniform 
magnetic field, B, perpendicular to its plane. The field (occupying the shaded region in Fig. 7.60) 
increases linearly with time (B = æ £). An ideal voltmeter (infinite internal resistance) is con- 
nected between points P and Q. 


Fig. 7.60 


(a) What is the current in the loop? 
(b) What does the voltmeter read? [Answer: ar2/2.] 


Problem 7.58. In the discussion of motional emf (Section 7.1.3) | assumed that the wire loop 
(Fig. 7.10) has a resistance R; the current generated is then 7 = vBh/ R. But what if the wire is 
made out of perfectly conducting material, so that R is zero? In that case, the current is limited 
only by the back emf associated with the self-inductance of the loop (which would ordinarily be 
negligible in comparison with ZR). As a model, let's simply replace the resistor by an inductor, 
L. Show that in this régime the loop (mass m) executes simple harmonic motion, and find its 
frequency. [Answer: w = Bhi YmL] 


Problem 7.59. 


(a) Use the Neumann formula (Eq. 7.32) to calculate the mutual inductance of the configuration 
in Fig. 7.37, assuming ais very small (a < b,a « z). Compare your answer to Prob. 7.23. 


42 See R. H. Romer, Am. J. Phys. 50, 1089 (1982). See also H. W. Nicholson, Am. J. Phys. 73, 
1194 (2005); B. H. McGuyer, Am. J. Phys. 80, 101 (2012). 

4 For a collection of related problems, see W. M. Saslow, Am. J. Phys. 55, 986 (1987); R. H. 
Romer, Eur. J. Phys. 11, 103 (1990). 
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(b) For the general case (not assuming a is small), show that 
_ Lomp 15 2 
M= 5 yabp(1 + -5B +--+), 


ab 
BS 2+a2+h2 


where 


Problem 7.60. Two coils are wrapped around a cylindrical form in such a way that the same 
flux passes through every turn of both coils. (In practice this is achieved by inserting an iron core 
through the cylinder; this has the effect of concentrating the flux.) The primary coil has Ny 
turns and the secondary has \V> (Fig. 7.61). If the current Z in the primary is changing, show 
that the emf in the secondary is given by 

& No 

aM 
where £; is the (back) emf of the primary. [This is a primitive transformer — a device for raising 
or lowering the emf of an alternating current source. By choosing the appropriate number of 
turns, any desired secondary emf can be obtained. If you think this violates the conservation of 
energy, study Prob. 7.61.] 


(7.76) 


. Secondary 
ee 1 (N, turns) 
turns 
1 jw» 
—— 
lee 
a 
Fig. 7.61 


Problem 7.61. A transformer (Prob. 7.60) takes an input AC voltage of amplitude V,, and 
delivers an output voltage of amplitude V2, which is determined by the turns ratio (V>/V, = 
N>/ Nj). If No > Nj, the output voltage is greater than the input voltage. Why doesn't 
this violate conservation of energy? [Answer: Power is the product of voltage and current; if the 
voltage goes up, the current must come down. The purpose of this problem is to see exactly how 
this works out, in a simplified model.] 


(a) In an ideal transformer, the same flux passes through all turns of the primary and of the 
secondary. Show that in this case M? = L; L2, where M is the mutual inductance of 
the coils, and Z4, L> are their individual self-inductances. 

(b) Suppose the primary is driven with AC voltage V;,, = Vy cos(wt), and the secondary is 
connected to a resistor, R. Show that the two currents satisfy the relations 
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n= + va = V} cos(wt), m= + m” =-hR. 
(c) Using the result in (a), solve these equations for 7; (£) and 7>(t). (Assume J; has no DC 
component.) 
(d) Show that the output voltage (Vout = Z>R) divided by the input voltage (V;,,,) is equal 
to the turns ratio: Vout/ Vin = N/N}. 
(e) Calculate the input power (Pin = V;,/;) and the output power (Pout = VoutZ2), and 
show that their averages over a full cycle are equal. 


Problem 7.62. An infinite wire runs along the z-axis; it carries a current Z (z) that is a function 
of z (but not of z), and a charge density A(z) that is a function of ¢ (but not of z). 


(a) By examining the charge flowing into a segment dz in a time dt, show that dA/dt 
—dI/dz. \f we stipulate that A(0) = O and (0) = 0, show that A(t) = kt, [(z) 
—kz, where k is a constant. 

(b) Assume for a moment that the process is quasistatic, so the fields are given by Eqs. 2.9 and 
5.38. Show that these are in fact the exact fields, by confirming that all four of Maxwell's 
equations are satisfied. (First do it in differential form, forthe region s > O, then in integral 
form for the appropriate Gaussian cylinder/Ampérian loop straddling the axis.) 


Problem 7.63. Suppose J(r) is constant in time but p (r, 7) is not — conditions that might 
prevail, for instance, during the charging of a capacitor. 


(a) Show that the charge density at any particular point is a linear function of time: 
p(r, t) = p(r, 0) + p(r, 0)ż, 
where p(r, 0) is the time derivative of p at t = O. [Hint: Use the continuity equation.] 


This is not an electrostatic or magnetostatic configuration;* nevertheless, rather surprisingly, 
both Coulomb's law (Eq. 2.8) and the Biot—Savart law (Eq. 5.42) hold, as you can confirm by 
showing that they satisfy Maxwell's equations. 


(b) Show that 
_ Ho f I@)x4 1, 
B(r) = mf = dt 


obeys Ampére’s law with Maxwell’ displacement current term (though J, here, is the 
ordinary (conduction) current, and does not include the displacement current). 


Problem 7.64. The magnetic field of an infinite straight wire carrying a steady current J can 
be obtained from the displacement current term in the Ampére/Maxwell law, as follows: Picture 


44 Some authors would regard this as magnetostatic, since B is independent of t. For them, 


the Biot—Savart law is a general rule of magnetostatics, but V -J = 0 and V xB = uoJ apply 
only under the additional assumption that p is constant. In such a formulation, Maxwell’s 
displacement term can (in this very special case) be derived from the Biot—Savart law, by 
the method of part (b). See D. F. Bartlett, Am. J. Phys. 58, 1168 (1990); D. J. Griffiths and 
M. A. Heald, Am. J. Phys. 59, 111 (1991). 
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the current as consisting of a uniform line charge A moving along the z-axis at speed v (so that 
I = Ay), witha tiny gap of length e, which reaches the origin at time ¢ = O. In the next 
instant (up to t = €/v) there is no real current passing through a circular Ampérian loop in the 
xy plane, but there is a displacement current, due to the “missing” charge in the gap. 


(a) Use Coulomb's law to calculate the z component of the electric field, for points in the 
xy plane a distance s from the origin, due to a segment of wire with uniform density —A 
extending from zı = vt — €t0 z2 = vt. 

(b) Determine the flux of this electric field through a circle of radius ain the xy plane. 

(c) Find the displacement current through this circle. Show that 74 is equal to Z, in the limit as 
the gap width (e) goes to zero. 


Problem 7.65. A certain transmission line is constructed from two thin metal “ribbons,” of 
width w, a very small distance h << w apart. The current travels down one strip and back 
along the other. In each case, it spreads out uniformly over the surface of the ribbon. 


(a) Find the capacitance per unit length, C. 

(b) Find the inductance per unit length, £. 

(c) What is the product £C, numerically? [The quantities £ and C will, of course, vary from 
one kind of transmission line to another, but their product is a universal constant — check, 
for example, the cable in Ex. 7.14 — provided the space between the conductors is a vacuum. 
In the theory of transmission lines, this product is related to the speed with which a pulse 
propagates down the line: v = 1/VLC] 

(d) Ifthe strips are insulated from one another by a nonconducting material of permittivity € 
and permeability u, what then is the product CC? What is the propagation speed? [Hint: 
See Ex. 4.6; by what factor does L change when an inductor is immersed in linear material 
of permeability u?] 


Problem 7.66. Two very long coaxial circular solenoids (radii a and 2a) have 47 and n turns 
per unit length, respectively. They both carry a steady current /, but in opposite directions. 


(a) Whatis the magnetic flux through a plane, coaxial, circular loop of radius 5a? 

(b) Now suppose they are driven by an alternating current, J = Ig cos wt. In the quasistatic 
approximation, what is the induced electric field in each region, s < a,a < s < 2a, 
s > 2a? 

(c) This changing electric field, in turn, induces a secondary magnetic field, B’. Find this field, 
for a point on the axis. 

(d) Compare B’ and B, for points on the axis — what is their ratio? Calculate the actual number, 
if the frequency is 60 Hz (ordinary household current) and a = 2 cm. 


Problem 7.67. Alfvén’s theorem states that in a perfectly conducting fluid (say, a gas of free 
electrons), the magnetic flux through any closed loop moving with the fluid is constant in time. 
(The magnetic field lines are, as it were, “frozen” into the fluid.) Prove it, as follows: 


45 For a slightly different approach to the same problem, see W. K. Terry, Am. J. Phys. 50, 742 


(1982). 
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(a) Use Ohm’ law, in the form of Eq. 7.2, together with Faraday’s law, to prove that if o = co 
and J is finite, then 
OB 


g A VRD: 


Fig. 7.62 


(b) Let Sbe the surface bounded by the loop (P) at time ¢, and S” a surface bounded by the 
loop in its new position (P’) at time ¢ + dt (see Fig. 7.62). The change in flux is 


d® = I Bo+d):da- | BO) - da. 
S S 
Use V - B = 0 to show that 


I. Bors di)-da | B(t + dt) - da = f Ba+an-aa 
S R S 
(where R is the “ribbon” joining P and P”), and hence that 


do=dt | T.da- f Be+d)-da 
s ôt R 


(for infinitesimal dt). Use the method of Section 7.1.3 to rewrite the second integral as 


ar (Bx) dl 
P 


and invoke Stokes’ theorem to conclude that 


d® OB 
T = [Geo PX xB) a 


Together with the result in (a), this proves the theorem. 


Problem 7.68. 


(a) Show that Maxwell's equations with magnetic charge (Eq. 7.53) are invariant under the 
duality transformation 
E’ = Ecosa + cBsina, 
cB’ = cBcosa — Esin g, 
; i (7.77) 
CY, = Ce COSA + Gm sina, 


, . 
Im = Jm COS Q — Cqe sin Q, 
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Fig. 7.63 


where c = 1/./€9 {0 and « is an arbitrary rotation angle in “E/B-space.’ Charge and 
current densities transform in the same way as qe and qm. [This means, in particular, that 
if you know the fields produced by a configuration of electric charge, you can immediately 
(using œ = 90°) write down the fields produced by the corresponding arrangement of 
magnetic charge.] 

(b) Show that the force law (Prob. 7.40) 


1 
F = qe(E + v x B) + qm(B - —vxE) (7.78) 
c 
is also invariant under the duality transformation. 


Problem7.69. Two coaxial circular rings carry steady currents 74 and Z2. They are moving at 
constant speed v in the direction of the axis (Fig. 7.63). Assume the motion is quasistatic, so the 
magnetic fields of the two loops are the same — though moving — as they would be at rest. 


(a 
(b 
(c 
(d 


What is the emf induced in loop 2, due to loop 1? 

Is there a motional emf in loop 2, due to loop 1? 

Is there a Faraday emf in loop 2, due to loop 1? 

Are your answers to (b) and (c) be consistent with (a)? Explain.“ 


SS SS 


46 This problem was suggested by an exchange with Hermann Batelaan. 


Intermission 


All of our cards are now on the table, and in a sense my job is done. In the 
first seven chapters we assembled electrodynamics piece by piece, and now, 
with Maxwell’s equations in their final form, the theory is complete. There 
are no more laws to be learned, no further generalizations to be considered, 
and (with perhaps one exception) no lurking inconsistencies to be resolved. 
If yours is a one-semester course, this would be a reasonable place to stop. 

But in another sense we have just arrived at the starting point. We are at 
last in possession of a full deck — it’s time to deal. This is the fun part, in 
which one comes to appreciate the extraordinary power and richness of 
electrodynamics. In a full-year course there should be plenty of time to 
cover the remaining chapters, and perhaps to supplement them with a unit 
on plasma physics, say, or AC circuit theory, or even a little general relativity. 
But if you have room for only one topic, I’d recommend Chapter 9, on elec- 
tromagnetic waves (you'll probably want to skim Chapter 8 as preparation). 
This is the segue to optics, and is historically the most important application 
of Maxwell’s theory. 


Conservation Laws 


8.1 Charge and Energy 


Coo 
8.1.1 The Continuity Equation 


In this chapter we study conservation of energy, momentum, and angular 
momentum, in electrodynamics. But I want to begin by reviewing the 
conservation of charge, because it is the paradigm for all conservation laws. 
What precisely does conservation of charge tell us? That the total charge in 
the Universe is constant? Well, sure — that’s global conservation of charge. 
But local conservation of charge is a much stronger statement: if the charge 
in some region changes, then exactly that amount of charge must have 
passed in or out through the surface. The tiger can’t simply rematerialize 
outside the cage; if it got from inside to outside it must have slipped through 
a hole in the fence. 
Formally, the charge in a volume V is 


Q(t) = {ec t) dt, (8.1) 


and the current flowing out through the boundary S is $5 J - da, so local 
conservation of charge says 


dQ _ 
Ge =~ PF da. (8.2) 


Using Eq. 8.1 to rewrite the left-hand side, and invoking the divergence 
theorem on the right, we have 


[ Bare fva, (8.3) 
v ôt v 


and since this is true for any volume, it follows that 


dp 
P y., 4 
a TVS (8.4) 


This is the continuity equation — the precise mathematical statement of local 
conservation of charge. It can be derived from Maxwell’s equation (take a 
moment to remind yourself how to do that) — conservation of charge is 
not an independent assumption; it is built into the laws of electrodynamics. 
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It serves as a constraint on the sources (p and J). They can’t be just any old 
functions — they have to respect conservation of charge.! 

The purpose of this chapter is to develop the corresponding equations 
for local conservation of energy and momentum. In the process (and 
perhaps more important) we will learn how to express the energy density 
and the momentum density (the analogs to p), as well as the energy 
“current” and the momentum “current” (analogous to J). 


8.1.2 Poynting’s Theorem 


In Chapter 2, we found that the work necessary to assemble a static charge 
distribution (against the Coulomb repulsion of like charges) is (Eq. 2.45) 


€ 
W. = a f Bar, 


where E is the resulting electric field. Likewise, the work required to get 
currents going (against the back emf) is (Eq. 7.44) 


1 

Wm = =— J B dt, 
2 Lo 

where B is the resulting magnetic field. This suggests that the total energy 

stored in electromagnetic fields, per unit volume, is 


u= Leor + z2) (8.5) 


In this section I will confirm Eq. 8.5, and develop the energy conservation 
law for electrodynamics. 

Suppose we have some charge and current configuration which, at time 
t, produces fields E and B. In the next instant, dt, the charges move around 
a bit. Question: How much work, dW, is done by the electromagnetic forces 
acting on these charges, in the interval dt? According to the Lorentz force 
law, the work done on a charge q is 


dW =F -dl=q(E+vxB)-vdt=gE-vdt. 


In terms of the charge and current densities, q > p dt and py > J,” so the 
rate at which work is done on all the charges in a volume Y is 


dW 
oa = | e-dar. (8.6) 


1 The continuity equation is the only such constraint. Any functions p(r,t) and J(r, t) 
consistent with Eq. 8.4 constitute possible charge and current densities, in the sense of 
admitting solutions to Maxwell’s equations. 

2 Of course, if charges of both signs are present, the net charge density can be zero even when 
the current is not — in fact, this is the case for ordinary current-carrying wires. We should 
really treat the positive and negative charges separately, and combine the two to get Eq. 8.6, 
with J = piv, + p_y_. See Chapter 5, footnote 8. 
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Evidently, E - J is the work done per unit time, per unit volume — which 
is to say, the power delivered per unit volume. We can express this quantity 
in terms of the fields alone, using the Ampére—Maxwell law to eliminate J:° 

1 E 
E.J = —E. (V xB)- €E. a 
Uo ot 


Product rule 6 says 
V-(ExB)=B-(VxE)-E-(V xB). 
Invoking Faraday’s law (V x E = —0B/012), it follows that 


E- (VxB)=-B: Z- v. (xB). 


Meanwhile, 
ðB _ 1ôəð v ðE lð 
a 79,8 ) and E T zE ), (8.7) 
so 
10 2 l v 1 
E- J = --— (eE + — B4) - —V . (ŒE x B). ; 
J zaa b ) i (E x B) (8.8) 


Putting this into Eq. 8.6, and applying the divergence theorem to the 
second term, we have 


dW dfl 4 ly 1 
T=- f ior +B) dr- f ExB) da, (8.9) 


where S is the surface bounding V. This is Poynting’s theorem; it is the 
“work-energy theorem” of electrodynamics. The first integral on the right 
is the total energy stored in the fields, f udt (Eq. 8.5). The second term 
evidently represents the rate at which energy is transported out of V, across 
its boundary surface, by the electromagnetic fields. Poynting’s theorem says, 
then, that the work done on the charges by the electromagnetic force is equal 
to the decrease in energy remaining in the fields, less the energy that flowed 
out through the surface. 

The energy per unit time, per unit area, transported by the fields is called 
the Poynting vector: 


S= | ExB). (8.10) 
Ho 


Specifically, S - da is the energy per unit time crossing the infinitesimal 
surface da — the energy flux (so S is the energy flux density).4 We will see 


3 As you will discover, this is a standard move in field theory: get rid of the sources ( p and J) 


in favor of the fields (E and B), using the field equations (Gauss and Ampére—Maxwell). 

4 If you're very fastidious, you'll notice a small gap in the logic here: we know from Eq. 8.9 
that $ S - da is the total power passing through a closed surface, but this does not prove that 
f S - da is the power passing through any open surface (there could be an extra term that 
integrates to zero over all closed surfaces). This is, however, the obvious and natural interpre- 
tation; as always, the precise location of energy is not really determined in electrodynamics 
(see Section 2.4.4). 
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many applications of the Poynting vector in Chapters 9 and 11, but for the 
moment I am mainly interested in using it to express Poynting’s theorem 


more compactly: 
dw d 
E A = - da. .11 
7; a fur $s da (8.11) 


What if no work is done on the charges in Y— what if, for example, Vis in 
a region of empty space, where there is no charge? In that case dW/dt = 0, so 


and hence 


“=-V:S. (8.12) 


This is the “continuity equation” for energy: u (energy density) plays the 
role of p (charge density), and S takes the part of J (current density). It 
expresses local conservation of electromagnetic energy. 

In general, though, electromagnetic energy by itself is not conserved (nor 
is the mechanical energy of the charges). Of course not! The fields do work 
on the charges, and the charges create fields — energy is tossed back and 
forth between them. In the overall energy economy, you must include the 
contributions of both the matter and the fields. 


Example 8.1. When current flows down a resistive wire, work is done, 
which shows up as Joule heating of the wire (Eq. 7.7). Though there 


are certainly easier ways to do it, the energy per unit time delivered 
to the wire can be calculated using the Poynting vector. Assuming it’s 
uniform, the electric field inside the wire is 


—— 
E 


where V is the potential difference between the ends and Lis the length 
of the wire (Fig. 8.1). The magnetic field is “circumferential”; at the 
surface (radius a) it has the value 

Hol 

B = —. 

2ra 

Accordingly, the magnitude of the Poynting vector is 
_1V pol VI 


~ Ho L 2na ~ InaL’ 


and it points radially inward. The energy per unit time passing in 
through the surface of the wire is therefore 


fS = S(2naL) = VI, 
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which is exactly what we concluded, on much more direct grounds, in 
Section 7.1.1.5 


E 
ds 
B 
a Jl 


Problem 8.1. Calculate the power (energy per unit time) transported down the cables of 
Ex. 7.14 and Prob. 7.65, assuming the two conductors are held at potential difference 7, and 
carry current Z (down one and back up the other). 


Problem 8.2. Imagine two infinite parallel nonconducting planes, a distance d apart, carry- 
ing equal and opposite uniform surface charges +o. A point charge q is at rest in the space 
between them, a distance zo above the —o surface. The electric field is E = Eo + E,. The 
total field energy is infinite, but the interaction energy (subtracting off the portions due to o 
alone and q alone) is finite. 


(a) Calculate the interaction energy of this configuration. 

(b) Compare the interaction energy to the potential energy of q (take it to be zero at the center 
between the planes). 

(c) From the gradient of the interaction energy determine the force on q (no surprise).° 


Problem 8.3. Consider the charging capacitor in Prob. 7.35. 


(a) Find the electric and magnetic fields in the gap, as functions of the distance s from the axis 
and the time z. (Assume the charge is zero at t = 0.) 

(b) Find the energy density tem and the Poynting vector S in the gap. Note especially the 
direction of S. Check that Eq. 8.12 is satisfied. 

(c) Determine the total energy in the gap, as a function of time. Calculate the total power 
flowing into the gap, by integrating the Poynting vector over the appropriate surface. Check 
that the power input is equal to the rate of increase of energy in the gap (Eq. 8.9 — in this 
case W = 0, because there is no charge in the gap). [If you're worried about the fringing 
fields, do it fora volume of radius b < a well inside the gap.] 


5 What about energy flow down the wire? For a discussion, see M. K. Harbola, Am. J. Phys. 
78, 1203 (2010). For a more sophisticated geometry, see B. S. Davis and L. Kaplan, Am. J. 
Phys. 79, 1155 (2011). 

6 For commentary, see R. C. Hilborn, Am. J. Phys. 84, 531 (2016). 
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Problem 8.4. For the charging sphere in Prob. 7.38: 


(a) find the Poynting vector; 

(b) find the energy density in the fields; 

(c) check that the electromagnetic energy is locally conserved (Eq. 8.12), except on the negative 
z-axis; 

(d) onthe negative z-axis, however, external work is being done on the current — pushing the 
charges in against the electric field. Calculate the external power delivered, and show that 
it is equal to the rate of increase of the energy in the fields.’ 


8.2 Momentum 
eS ae eae |) 


8.2.1 Newton’s Third Law in Electrodynamics 


Imagine a point charge q traveling in along the x-axis at a constant speed 
v. Because it is moving, its electric field is not given by Coulomb’s law; 
nevertheless, E still points radially outward from the instantaneous position 
of the charge (Fig. 8.2a), as we'll see in Chapter 10. Since, moreover, a 
moving point charge does not constitute a steady current, its magnetic field 
is not given by the Biot—Savart law. Nevertheless, it’s a fact that B still circles 
around the axis in a manner suggested by the right-hand rule (Fig. 8.2b); 
again, the proof will come in Chapter 10. 

Now suppose this charge encounters an identical one, proceeding in 
at the same speed along the y-axis. Of course, the electromagnetic force 
between them would tend to drive them off the axes, but let’s assume that 
they’re mounted on tracks, or something, so they’re obliged to maintain the 
same direction and the same speed (Fig. 8.3). The electric force between 
them is repulsive, but how about the magnetic force? Well, the magnetic 
field of qı points into the page (at the position of q2), so the magnetic force 


(b) 


7 T.H. Boyer, Am. J. Phys. 87, 729 (2019). 
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=y 


on q2 is toward the right, whereas the magnetic field of q2 is out of the 
page (at the position of qı), and the magnetic force on qı is upward. The net 
electromagnetic force of qı on q is equal but not opposite to the force of qn 
on qu, in violation of Newton’s third law. In electrostatics and magnetostatics 
the third law holds, but in electrodynamics it does not. 

Well, that’s an interesting curiosity, but then, how often does one actually 
use the third law, in practice? Answer: All the time! For the proof of 
conservation of momentum rests on the cancelation of internal forces, 
which follows from the third law. When you tamper with the third law, you 
are placing conservation of momentum in jeopardy, and there is hardly any 
principle in physics more sacred than that. 

Momentum conservation is rescued, in electrodynamics, by the real- 
ization that the fields themselves carry momentum. This is not so sur- 
prising when you consider that we have already attributed energy to the 
fields. Whatever momentum is lost to the particles is gained by the fields. 
Only when the field momentum is added to the mechanical momentum is 
momentum conservation restored. 


8.2.2 Maxwell's Stress Tensor 


Let’s calculate the total electromagnetic force on the charges in volume V: 


F= | (e+vxBypde= f (pE+JxB)dr. (8.13) 
v v 


The force per unit volume is 
f=pE+JxB. (8.14) 


As before, I propose to express this in terms of fields alone, eliminating 
p and J by using Maxwell’s equations (i) and (iv): 
OE 


1 
ea rah ard 


Now 


2 (ExB) = (5 xB) + (Ex Ë 
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and Faraday’s law says 


OB 
—=-VxE 
Ot is 
so 
OE ô 
— xB=— ; 
ae Go ee) 
Thus 


f = eo[(V - E)E - E x (V x E)] LIB x (Y xB)] -e02 ŒxB). (8.15) 


Wo 


Just to make things look more symmetrical, let’s throw in a term 
(V-B)B; since V-B=0, this costs us nothing. Meanwhile, product rule 
4 says 


V(E*) = 2(E- V)E +2Ex (V x E), 
SO 


Ex(VxE) = SVE") - (E - VJE, 


and the same goes for B. Therefore, 


f = eo[(V - E)E + (Œ - V)E] + ziv - B)B + (B- V)B] 
0 
- 1 (cok? + Lg?) - e02 Œ xB) (8.16) 
2 Lo at 
Ugly! But it can be simplified by introducing the Maxwell stress tensor, 
= le p2 1 le 
Tj — eo(E:E; = 3 OVE ) + wig (BB = 308 ). (8.17) 


The indices ¿and j refer to the coordinates x, y, and z, so the stress tensor has 
a total of nine components (Ty, Ty, Txz, Tyx, and so on). The Kronecker 
delta, 6;, is 1 if the indices are the same (6x. = 6yy = 6, = 1) and 0 otherwise 
(Ox) = Ôxz = dy, = 0). Thus 


1 2 2 2 1l 2 2 2 
Txx = zE Ex a E; = EZ) + Zuo (By = B; = B$), 
1 
Ty = €0(EYE,) F m 
0 


and so on. 
Because it carries two indices, where a vector has only one, Tj is some- 


times written with a double arrow: T . One can form the dot product of T 
with a vector a, in two ways — on the left, and on the right: 


a- T) =) aT; (Tea) = J, Tja (8.18) 
i=X,),Z i=x,y,Z 
The resulting object, which has one remaining index, is itself a vector. In 
=> 
particular, the divergence of T has, as its jth component, 
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(V: T) = eo|(V - E)E; + Œ: V)E; - Ive? 
1 1 
F [(V - B)B; + (B- V)B;- 5V;2?]. 


Thus the force per unit volume (Eq. 8.16) can be written in the much tidier 
form 
f=. T- eop, (8.19) 


where S is the Poynting vector (Eq. 8.10). 
The total electromagnetic force on the charges in V (Eq. 8.13) is 


oO d 
r= T da- eoo; | Sar. (8.20) 
s dt Jy 


(I used the divergence theorem to convert the first term to a surface inte- 
gral.) In the static case the second term drops out, and the electromagnetic 
force on the charge configuration can be expressed entirely in terms of the 
stress tensor at the boundary: 


= ; 
F - T -da (static). (8.21) 
S 


: >. : ; 
Physically, T is the force per unit area (or stress) acting on the surface. 
More precisely, Ty is the force (per unit area) in the ith direction acting on 
an element of surface oriented in the jth direction — “diagonal” elements 


(Tx, Ty, T-,) represent pressure, and “off-diagonal” elements (Ty, Tyz, 


etc.) are shears. 


Example 8.2. Determine the net force on the “northern” hemisphere 
of a (stationary) uniformly charged solid sphere of radius R and charge 
Q (the same as Prob. 2.48, only this time we’ll use the Maxwell stress 
tensor and Eq. 8.21). 


Solution. The boundary surface consists of two parts — a hemispher- 
ical “bowl” at radius R, and a circular disk at 0 = 7/2 (Fig. 8.4). For 
the bowl, 


da = R’ sin 0 d0 dot 
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and 
1 Q. 


= 4T Eg RU 


In Cartesian components, 
f = sin 0 cos ọ Å + sin 0 sing Ẹ + cos Oz, 


sO 


Q 


2 
7 m) sin O cos 0 cos ¢, 
TTEQ 


T-x = €0E-Ex = eo( 


2 
T. = €0E-E; = TER sin 0 cos 0 sin @, 


Arey R2 
Ta, = 2 (g2 - pe - By = ©2(_2 
z= 7 E-E-E) nR 


The net force is obviously in the z direction, so it suffices to calculate 


€0 Ẹ 


)’ (cos? 0 = sin? 0). (8.22) 


= Eo; Q V, 
(T -da), = Tx da,+T da,+T-- da; = ma ey) sin 0 cos 0 d0 dd. 


The force on the “bowl” is therefore 


7/2 2 
€o; Q y . 1 Q 
ma =. (82 
Frowi =e Geen) an f sin 0 cos0 d0 = pga (8.23) 


Meanwhile, for the equatorial disk, 
da = -rdrdoz, (8.24) 


and (since we are now inside the sphere) 


1 Q _ 1 Q 


= Tua, p! = Tna B r(cos ọ Å + sin ọ Ŷ). 
Thus 
_ 2 _ p2_ph__ €0 Q \25 
T= (EBLE) =— e) 
and hence 
(T - da). = (L Pr drdo 
= 2 \4rteqR3 ` 
The force on the disk is therefore 
R 2 
c&; Q \ 3 1 Q 
Fisk = —(———=—) 2 ” dr = — =, 8.25 
disk 7 (ae) n f r ar Are, 16R2 ( ) 


Combining Eqs. 8.23 and 8.25, I conclude that the net force on the 
northern hemisphere is 


bao 
~ 47€9 16R2" 


Incidentally, in applying Eq. 8.21, any volume that encloses all of 
the charge in question (and no other charge) will do the job. For 


(8.26) 
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example, in the present case we could use the whole region z > 0. In 

that case the boundary surface consists of the entire xy plane (plus a 

hemisphere at r = œ, but E = 0 out there, so it contributes nothing). 

In place of the “bowl,” we now have the outer portion of the plane 
(r > R). Here, 

€ 21 

T=- a 


“ 2 \4re€9’ ri 


(Eq. 8.22 with 0 = 77/2 and R > r), and dais given by Eq. 8.24, so 


(T - da), = S Q drdọ, 


4m€9’ r 


and the contribution from the plane for r > R is 


€0 Q 2 ami _ 1 Q? 
2 Cera 2n f FO Fey are 


the same as for the bowl (Eq. 8.23). 


I hope you didn’t get too bogged down in the details of Ex. 8.2. If so, take 
a moment to appreciate what happened. We were calculating the force on 
a solid object, but instead of doing a volume integral, as you might expect, 
Eq. 8.21 allowed us to set it up as a surface integral; somehow the stress 
tensor sniffs out what is going on inside. 


! Problem 8.5. Calculate the force of magnetic attraction between the northern and southern 
hemispheres of a uniformly charged spinning spherical shell, with radius R, angular velocity 
œw, and surface charge density o. [This is the same as Prob. 5.47, but this time use the Maxwell 
stress tensor and Eq. 8.21.] 


Problem 8.6. 


(a) Consider two equal point charges q, separated by a distance 2a. Construct the plane 
equidistant from the two charges. By integrating Maxwell's stress tensor over this plane, 
determine the force of one charge on the other. 

(b) Do the same for charges that are opposite in sign. 


8.2.3 Conservation of Momentum 


According to Newton’s second law, the force on an object is equal to the 
rate of change of its momentum: 


AP mech 
F = mec: : 
dt 
Equation 8.20 can therefore be written in the form® 


8 Let’s assume the only forces acting are electromagnetic. You can include other forces if you 
like — both here and in the discussion of energy conservation — but they are just a distraction 
from the essential story. 
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AP mech = d T 
e = enpo [saz + ¢ T - da, (8.27) 


where Pech 18 the (mechanical) momentum (>) m;v;) of the particles in 
volume V.’ This expression is similar in structure to Poynting’s theorem 
(Eq. 8.11), and it invites an analogous interpretation: the first integral 
represents momentum stored in the fields, 


p= Heo | Sanr, (8.28) 
v 


while the second integral is the momentum per unit time flowing in through 
the surface. 

Equation 8.27 is the statement of conservation of momentum in electro- 
dynamics: if the mechanical momentum increases, either the field momen- 
tum decreases, or else the fields are carrying momentum into the volume 
through the surface. The momentum density in the fields is evidently 


g = Ho€oS = €o(E x B), (8.29) 


and the momentum flux transported by the fields is ST (more precisely, 


-fT - da is the electromagnetic momentum per unit time passing through 
the area da). 

If the mechanical momentum in VY is not changing (for example, if we 
are talking about a region of empty space), then 


[ Bac=P P-aa= |v Par, 


og => 
ae V-T. (8.30) 


This is the “continuity equation” for electromagnetic momentum, with g 


and hence 


(momentum density) in the role of p (charge density) and -T playing the 
part of J. Equation 8.30 expresses the local conservation of field momen- 
tum. But in general (when there are charges around) the field momentum by 
itself, and the mechanical momentum by itself, are not conserved — charges 
and fields exchange momentum, and only the total is conserved. 

Notice that the Poynting vector has appeared in two quite different 
roles: S itself is the energy per unit area, per unit time, transported by 
the electromagnetic fields, while uo€oS is the momentum per unit volume 
stored in those fields.'° Similarly, T plays a dual role: T itself is the 
electromagnetic stress (force per unit area) acting on a surface, and -T 
describes the flow of momentum (it is the momentum current density) 
carried by the fields. 


? Again, to avoid distracting complications I assume no particles enter or leave V. 
10 This is no coincidence — see R. P. Feynman, R. B. Leighton, and M. Sands, The Feynman 
Lectures on Physics, Vol. 2, Addison-Wesley, Reading, MA (1964), Sect. 27-6. 
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Example 8.3. A long resistanceless coaxial cable, of length /, consists 
of an inner conductor (radius a) and an outer conductor (radius b). It is 
connected to a battery at one end and a resistor at the other (Fig. 8.5). 
The inner conductor carries a uniform charge per unit length A, and 
a steady current J to the right; the outer conductor has the opposite 
charge and current. What is the electromagnetic momentum stored in 
the fields? 


The Poynting vector is therefore 


Al, 
=. 
477 €qs2 
So energy is flowing down the line, from the battery to the resistor. In 
fact, the power transported is 


I? 1 I 
p= [ S-da= 2 S —2rs ds = L In(b/a) = IV, 
4n?Eo Ja s2 2TE0 


as it should be. 
The momentum in the fields is 


b 
P = po€0 a Sdt=4 ri f Slams ds = n HT agia enpi. 
This is an astonishing result. The cable is not moving, E and B are 
static, and yet we are asked to believe that there is momentum in the 
fields. If something tells you this cannot be the whole story, you have 
sound intuitions. But the resolution of this paradox will have to await 
Chapter 12 (Ex. 12.12). 

Suppose now that we turn up the resistance, so the current decreases. 
The changing magnetic field will induce an electric field (Eq. 7.29): 
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This field exerts a force on +A: 

uo dI uo dI 
2n dt 2n dt 
The total momentum imparted to the cable, as the current drops from 


I to 0, is therefore 
WoA 
Pmech = fra = on 


Al 
-5 T incola), 


F = Al[ — Ina + K|- All — lnb + K|ż = 


which is precisely the momentum originally stored in the fields. 


Problem 8.7. Imagine two parallel infinite sheets, carrying uniform surface charge +o (on 
the sheet at z = d) and —o (at z = 0). They are moving in the y direction at constant speed 
v (as in Prob. 5.18). 


(a) What is the electromagnetic momentum in a region of area A? 

(b) Now suppose the top sheet moves slowly down (speed u) until it reaches the bottom sheet, 
so the fields disappear. By calculating the total force on the charge (q = oA), show that 
the impulse delivered to the sheet is equal to the momentum originally stored in the fields. 
[Hint: As the upper plate passes by, the magnetic field drops to zero, inducing an electric 
field that delivers an impulse to the lower plate.] 


Problem 8.8. A charged parallel-plate capacitor (with uniform electric field E = Ez) is 
placed in a uniform magnetic field B = Bx, as shown in Fig. 8.6. 


(a) Find the electromagnetic momentum in the space between the plates. 

(b) Now aresistive wire is connected between the plates, along the z-axis, so that the capacitor 
Slowly discharges. The current through the wire will experience a magnetic force; what is 
the total impulse delivered to the system, during the discharge?" 


Fig. 8.6 


Problem 8.9. Consider an infinite parallel-plate capacitor, with the lower plate (at z = 
—d/2) carrying surface charge density —o, and the upper plate (at z = +d/2) carrying charge 
density +o. 


11 There is much more to be said about this problem, so don’t get too excited if your answers 
to (a) and (b) appear to be consistent. See D. Babson et al., Am. J. Phys. 77, 826 (2009). 
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(a) Determine all nine elements of the stress tensor, in the region between the plates. Display 
your answer as a 3 x 3 matrix: 


Ty Txy Txz 
Tk Ty ah 
Tzx Tzy Tzz 


(b) Use Eq. 8.21 to determine the electromagnetic force per unit area on the top plate. Compare 
Eq. 2.51. 

(c) Whatis the electromagnetic momentum per unit area, per unit time, crossing the xy plane 
(or any other plane parallel to that one, between the plates)? 

(d) Of course, there must be mechanical forces holding the plates apart — perhaps the capacitor 
is filled with insulating material under pressure. Suppose we suddenly remove the insulator; 
the momentum flux (c) is now absorbed by the plates, and they begin to move. Find the 
momentum per unit time delivered to the top plate (which is to say, the force acting on 
it) and compare your answer to (b). [Note: This is not an additional force, but rather an 
alternative way of calculating the same force — in (b) we got it from the force law, and in 
(d) we do it by conservation of momentum.] 


8.2.4 Angular Momentum 


By now, the electromagnetic fields (which started out as mediators of forces 
between charges) have taken on a life of their own. They carry energy 
(Eq. 8.5) 


1 2 1 2 
u= 5(cok bh (8.31) 


and momentum (Eq. 8.29) 
g = co (E x B), (8.32) 
and, for that matter, angular momentum 
t =r xg = colr x (EXB)]. (8.33) 


Even perfectly static fields can harbor momentum and angular momentum, 
as long as E x B is nonzero, and it is only when these field contributions are 
included that the conservation laws are sustained. 


Example 8.4. Imagine a very long solenoid with radius R, n turns per 
unit length, and current 7. Coaxial with the solenoid are two long 
cylindrical (nonconducting) shells of length /— one, inside the solenoid 
at radius a, carries a charge +Q, uniformly distributed over its surface; 
the other, outside the solenoid at radius b, carries charge —Q (see 
Fig. 8.7; / is supposed to be much greater than b). When the current 
in the solenoid is gradually reduced, the cylinders begin to rotate, as 
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we found in Ex. 7.9. Question: Where does the angular momentum 
come from?!? 


Solution. It was initially stored in the fields. Before the current was 
switched off, there was an electric field, 


E= Q l; (a< s< b), 
2TEol s 


in the region between the cylinders, and a magnetic field, 
B = ponlz (s< R), 


inside the solenoid. The momentum density (Eq. 8.29) was therefore 
= uonIQ a 
8 OnIs $: 
in the region a < s < R. The z component of the angular momentum 
density was 


which is constant (independent of s). To get the total angular momen- 
tum in the fields, we simply multiply by the volume, n (R? — a?)1:1 


L= E LonIQ(R — a’)2. (8.34) 


When the current is turned off, the changing magnetic field induces 
a circumferential electric field, given by Faraday’s law: 


This is a variation on the “Feynman disk paradox” (R. P. Feynman, R. B. Leighton, and 
M. Sands, The Feynman Lectures on Physics, Vol. 2, Addison-Wesley, Reading, MA (1964), 
pp. 17-5) suggested by F. L. Boos, Jr., Am. J. Phys. 52, 756 (1984). A similar model was 
proposed earlier by R. H. Romer, Am. J. Phys. 34, 772 (1966). For further references, see 


T.-C. E. Ma, Am. J. Phys. 54, 949 (1986). 
The radial component integrates to zero, by symmetry. 
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1 dI R? ~ 
—- uon — — R 
E= qt s POSA 
1 dI, 
—z Hon sp (s< R). 
Thus the torque on the outer cylinder is 
1 dI. 
N; =r x (-QE) = 7 HonQR TE 
and it picks up an angular momentum 
1 ° dI 1 
L, = = Re | —dt=-=ponIOR i. 
b 5 Hon af P7 dt 5 Hon OR7 
Similarly, the torque on the inner cylinder is 
1 dI 
N, = ~ 5 Hon Qa’ — 2, 
and its angular momentum increase is 


1 
L, = uonIQa’?. 


So it all works out: Lem = La + Ly. The angular momentum Jost 
by the fields is precisely equal to the angular momentum gained by 
the cylinders, and the total angular momentum (fields plus matter) is 
conserved. 


Caution: Field energy does not obey the superposition principle — the 
energy of a composite system is not the sum of the energies of its constituent 
parts. That’s because energy (Eq. 8.5) is quadratic in E and B. The fields 
themselves do satisfy the superposition principle, so, for instance, 


E= E; + FE), 
and hence the electric part of the energy density is 
1 1 
u= zŒ ; E) = zEoEi + E? + 2E; : E2) = U +U + €E; p Ep. 


The cross term spoils it. To take a trivial example, imagine a sphere of 
charge Q; the same sphere with double the charge would have twice the 


field, but four times the energy. And the same goes for the Poynting vector, 


momentum, angular momentum, and the stress tensor, since these are all 
quadratic in the fields. 


Problem 8.10. In Ex. 8.4, suppose that instead of turning off the magnetic field (by reducing 
T) we turn off the electric field, by connecting a weakly" conducting radial spoke between 
the cylinders. (We'll have to cut a slot in the solenoid, so the cylinders can still rotate freely.) 
From the magnetic force on the current in the spoke, determine the total angular momentum 


'4 Tn Ex. 8.4 we turned the current off slowly, to keep things quasistatic; here we reduce the 
electric field slowly to keep the displacement current negligible. 
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delivered to the cylinders, as they discharge (they are now rigidly connected, so they rotate 
together). Compare the initial angular momentum stored in the fields (Eq. 8.34). (Notice that the 
mechanism by which angular momentum is transferred from the fields to the cylinders is entirely 
different in the two cases: in Ex. 8.4 it was Faraday’s law, but here it is the Lorentz force law.) 


Problem 8.11. Two concentric spherical shells carry uniformly distributed charges +O (at 
radius a) and — Ọ (at radius b > a). They are immersed in a uniform magnetic field B = Boz. 


(a) Find the angular momentum of the fields (with respect to the center). 
(b) Now the magnetic field is gradually turned off. Find the torque on each sphere, and the 
resulting angular momentum of the system. 


Problem 8.12." Imagine an iron sphere of radius R that carries a charge Q and a uniform 
magnetization M = Mz. The sphere is initially at rest. 


(a) Compute the angular momentum stored in the electromagnetic fields. 

(b) Suppose the sphere is gradually (and uniformly) demagnetized (perhaps by heating it up 
past the Curie point). Use Faraday’s law to determine the induced electric field, find the 
torque this field exerts on the sphere, and calculate the total angular momentum imparted 
to the sphere in the course of the demagnetization. 

(c) Suppose instead of demagnetizing the sphere we discharge it, by connecting a grounding 
wire to the north pole. Assume the current flows over the surface in such a way that the 
(surface) charge density remains uniform. Use the Lorentz force law to determine the torque 
on the sphere, and calculate the total angular momentum imparted to the sphere in the 
course of the discharge. (The magnetic field is discontinuous at the surface ... does this 
matter?) (Answer: § uo MOR? |] 


8.3 Magnetic Forces Do No Work 


This!® is perhaps a good place to revisit the old paradox that magnetic forces 
do no work (Eq. 5.11). What about that magnetic crane lifting the carcass 
of a junked car? Somebody is doing work on the car, and if it’s not the 
magnetic field, who is it? The car is ferromagnetic; in the presence of the 
magnetic field, it contains a lot of microscopic magnetic dipoles (spinning 
electrons, actually), all lined up. The resulting magnetization is equivalent 
to a bound current running around the surface, so let’s model the car as a 
circular current loop. 


15 This version of the Feynman disk paradox was proposed by N. L. Sharma, Am. J. Phys. 
56, 420 (1988); similar models were analyzed by E. M. Pugh and G. E. Pugh, Am. J. Phys. 
35, 153 (1967) and by R. H. Romer, Am. J. Phys. 35, 445 (1967). 

16 This section can be skipped without loss of continuity. I include it for those readers who 
are disturbed by the notion that magnetic forces do no work. 
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Example 8.5. Picture the current loop as a nonconducting ring, of 
line charge A, rotating at angular velocity w (Fig. 8.8). The upward 
magnetic force on the loop is (Eq. 6.2) 


F = 2nJaB,, (8.35) 


where B, is the radial component of the magnet’s field,!” and J = Awa. 
If the ring rises a distance dz (while the magnet itself stays put), the 
work done on it is 


dW =2na’AwB, dz. (8.36) 


This increases the potential energy of the ring. Who did the work? 
Naively, it appears that the magnetic field is responsible, but we have 
already learned (Ex. 5.3) that this is not the case — as the ring rises, the 
magnetic force is perpendicular to the net velocity of the charges in the 
ring, so it does no work on them. 

At the same time, however, a motional emf is induced in the ring, 
which opposes the flow of charge, and hence reduces its angular 
velocity: 


Here d@ is the flux through the “ribbon” joining the ring at time ¢ to 
the ring at time ¢ + dt (Fig. 8.9): 


d® = B 2ra dz. 


Now 
E= ff -di=fOna), 
where f is the force per unit charge. So 
f = -Bp (8.37) 
the force on a segment of length dl is fA dl, the torque on the ring is 


N= a(-B,) AQna), 


17 Note that the field has to be nonuniform, or it won’t lift the car at all. Also, stipulate for now 
that the magnetic field remains constant as the loop rises. 
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and the work done (slowing the rotation) is N dọ = Nw dt, or 
dW = —-2na’ AwB, dz. (8.38) 


The ring slows down, and the rotational energy it loses (Eq. 8.38) 
is precisely equal to the potential energy it gains (Eq. 8.36). All the 
magnetic field did was convert energy from one form to another. If 
you'll permit some sloppy language, the work done by the vertical 
component of the magnetic force (Eq. 8.35) is equal and opposite to 
the work done by its horizontal component (Eq. 8.37).!8 


Example 8.6. What about the magnet? Is it completely passive in this 
process? Suppose we model it as a big circular loop (radius b), resting 
on a table and carrying a current Jp; the “junk car” is a relatively 
small current loop (radius a), on the floor directly below, carrying a 
current J, (Fig. 8.10). This time, just for a change, let’s assume both 
currents are constant (we'll include a regulated power supply in each 
loop!’). Parallel currents attract, and we propose to lift the small loop 
off the floor, keeping careful track of the work done and the agency 
responsible. 

Let’s start by adjusting the currents so the small ring just “floats,” 
a distance h below the table, with the magnetic force exactly balancing 
the weight (m,g) of the little ring. P1 let you calculate the magnetic 
force (Prob. 8.13): 


This argument is essentially the same as the one in Ex. 5.3, except that in this case I told 

the story in terms of motional emf, instead of the Lorentz force law. But after all, the flux 
rule is a consequence of the Lorentz force law. 
The lower loop could be a single spinning electron, in which case quantum mechanics fixes 
its angular momentum at f/2. It might appear that this sustains the current, with no need 
for a power supply. I will return to this point, but for now let’s just keep quantum mechanics 
out of it. 
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3T bh 
F mag = 7 Uolalp (Be F jy = Mag. (8.39) 
Now the loop rises an infinitesimal distance dz; the work done is equal 
to the gain in its potential energy 


3T “bh 
dW, = Mag dz= g Hola aa pays dz. (8.40) 
Who did it? The magnetic field? No! The work was done by the 
power supply that sustains the current in loop a (Ex. 5.3). As the loop 


rises, a motional emf is induced in it. The flux through the loop is 
®, = Mh, 
where M is the mutual inductance of the two loops: 


poti Ze 
2 (b2 + h?)32 


(Prob. 7.23). The emf is 


d®, „dM _ dMdh 
eS ae ah 
3\ Tuo @b? (-dz) 
i =h(- ) 2 
27 2 (b+ dt 


The work done by the power supply (fighting against this motional 
emf) is 
ahh : 
(b2 + h2)5/2 g 
— the same as the work done in lifting the loop (Eq. 8.40). 
Meanwhile, however, a Faraday emf is induced in the upper loop, 
due to the changing flux from the lower loop: 


dM 

dt’? 

and the work done by the power supply in ring b (to sustain the 
current Tp) is 


dW, = -Eala dt = ir ight (8.41) 


OD, = MI, > Ey = -la 


ahh 
(b2 + h2)5/2 
exactly the same as dW. That’s embarrassing — the power supplies have 
done twice as much work as was necessary to lift the junk car! Where 
did the “wasted” energy go? Answer: It increased the energy stored in 


the fields. The energy in a system of two current-carrying loops is (see 
Prob. 8.14) 


dz, (8.42) 


dW, = —E,l, dt = = Holalp 


1 1 
U= shal + 5 Lolly + Mlh, (8.43) 
SO 
dM 
dU = Ll, — dt = dWyj. 


dt 


8.3 Magnetic Forces Do No Work 


Remarkably, all four energy increments are the same. If we care to 
apportion things this way, the power supply in loop a contributes the 
energy necessary to lift the lower ring, while the power supply in loop 
b provides the extra energy for the fields. If all we’re interested in is 
the work done to raise the ring, we can ignore the upper loop (and the 
energy in the fields) altogether. 


Example 8.7. In both these models, the magnet itself was stationary. 
That’s like lifting a paper clip by holding a magnet over it. But in the 
case of the magnetic crane, the car stays in contact with the magnet, 
which is attached to a cable that lifts the whole works. As a model, we 
might stick the upper loop in a big box, the lower loop in a little box, 
and crank up the currents so the force of attraction is much greater 
than mag; the two boxes snap together, and we attach a string to the 
upper box and pull up on it (Fig. 8.11). 

The same old mechanism (Ex. 5.3) prevails: as the lower loop rises, 
the magnetic force tilts backward; its vertical component lifts the loop, 
but its horizontal component opposes the current, and no net work is 
done. This time, however, the motional emf is perfectly balanced by the 
Faraday emf fighting to keep the current going — the flux through the 
lower loop is not changing. (If you like, the flux is increasing because 
the loop is moving upward, into a region of higher magnetic field, 
but it is decreasing because the magnetic field of the upper loop — at 
any given point in space — is decreasing as that loop moves up. See 
Prob. 7.69.) No power supply is needed to sustain the current (and 
for that matter, no power supply is required in the upper loop either, 
since the energy in the fields is not changing). Who did the work to lift 
the car? The person pulling up on the rope, obviously. The role of the 
magnetic field was merely to transmit this energy to the car, via the 
vertical component of the magnetic force. But the magnetic field itself 
(as always) did no work. 


The fact that magnetic forces do no work follows directly from the 
Lorentz force law, so if you think you have discovered an exception, you're 
going to have to explain why that law is incorrect. For example, if magnetic 
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monopoles exist, the force on a particle with electric charge qe and magnetic 
charge qm becomes (Prob. 7.40) 


F = qe (Œ + v x B) + Gn(B— €o oY x E). (8.44) 


In that case, magnetic fields can do work ... but only on magnetic charges. So 
unless your car is made of monopoles (I don’t think so), this doesn’t solve 
the problem. 

A somewhat less radical possibility is that in addition to electric charges 
there exist permanent point magnetic dipoles (electrons?), whose dipole 
moment m; is not associated with any electric current, but simply is. The 
Lorentz force law would then acquire an extra term 


F = q(E + v x B) + Vim; - B). 


The magnetic field could do work on these “intrinsic” dipoles (which 
experience no motional or Faraday emf, since they enclose no flux). I don’t 
know whether a consistent theory can be constructed in this way,” but 
in any event it is not classical electrodynamics, which is predicated on 
Ampére’s assumption that all magnetic phenomena are due to electric 
charges in motion — and point magnetic dipoles must be interpreted as the 
limits of tiny current loops. 

Of course, the real world does include particles with intrinsic spin, but 
they are governed by the laws of quantum mechanics, not classical mechan- 
ics. You can ask, if you like, whether guantum electrodynamics allows 
magnetic forces to do work. Unfortunately, this is not an easy question 
to address; “force” and “work” are not concepts that arise naturally in 
quantum mechanics. In classical electrodynamics, though, the answer is 
unequivocal: magnetic forces do no work. Period. 


Problem 8.13. Derive Eq. 8.39. [Hint: Treat the lower loop as a magnetic dipole.] 


Problem 8.14. Derive Eq. 8.43. [Hint: Use the method of Section 7.2.4, building the two cur- 
rents up from zero to their final values. ] 


More Problems on Chapter 8 
—_—E——————————————————————£———| 


Problem 8.15.” Avery long solenoid of radius a, with z turns per unit length, carries a current 
Iy. Coaxial with the solenoid, at radius b >> a, isa circular ring of wire, with resistance R. When 
the current in the solenoid is (gradually) decreased, a current Z, is induced in the ring. 


(a) Calculate J,., in terms of dJ;/dt. 


20 Semiclassical theories of intrinsic spin (and the associated intrinsic magnetic dipole 
moment) have been proposed from time to time — see, for example, J. Barandes, Found. Phys. 
51, 79 (2021). They tend to be cumbersome and implausible, but never mind: this is certainly 
something worth exploring. 

21 For extensive discussion, see M. A. Heald, Am. J. Phys. 56, 540 (1988). 
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(b) Find the induced electric field as a function of s (the distance from the axis), and use this 
result to determine the work per unit time it does on the ring. Check that P = 72 R.? 

(c) The power delivered to the ring must come from the solenoid. Confirm this by calculating 
the Poynting vector just outside the solenoid (the electric field is due to the changing flux in 
the solenoid; the magnetic field is due to the current in the ring). Integrate over the entire 
surface of the solenoid, and check that you recover the correct total power. 


Problem 8.16. An infinitely long cylindrical tube, of radius a, moves at constant speed v 
along its axis. It carries a net charge per unit length A, uniformly distributed over its surface. 
Surrounding it, at radius b, is another cylinder, moving with the same velocity but carrying the 
opposite charge (—A). Find: 


(a) the energy per unit length stored in the fields; 

(b) the momentum per unit length in the fields; 

(c) the energy per unit time transported by the fields across a plane perpendicular to the 
cylinders. 


Problem 8.17. A point charge q is located at the center of a toroidal coil of rectangular cross 
section, inner radius a, outer radius a + w, and height /, which carries a total of N tightly 
wound turns and current 7. 


(a) Find the electromagnetic momentum p of this configuration, assuming that w and / are 
both much less than a (so you can ignore the variation of the fields over the cross section). 

(b) Now the current in the toroid is turned off, quickly enough that the point charge does not 
move appreciably as the magnetic field drops to zero. Show that the impulse imparted to q 
is equal to the momentum originally stored in the electromagnetic fields. [Hint: You might 
want to refer to Prob. 7.20.] 


Problem 8.18.” A sphere of radius R carries a uniform polarization P and a uniform mag- 
netization M (not necessarily in the same direction). Find the electromagnetic momentum of 
this configuration. [Answer: (4/9) 7¢ uo R? (M x P),] 


Problem 8.19.” Picture the electron as a uniformly charged spherical shell, with charge e 
and radius R, spinning at angular velocity w. 


(a) Calculate the total energy contained in the electromagnetic fields. 

(b) Calculate the total angular momentum contained in the fields. 

(c) According to the Einstein formula (E = mc’), the energy in the fields should contribute to 
the mass of the electron. Lorentz and others speculated that the entire mass of the electron 
might be accounted for in this way: Uem = Mec”. Suppose, moreover, that the electron’s 
spin angular momentum is entirely attributable to the electromagnetic fields: Lem = %/2. 


N 


2 This part was suggested by Sami Al-Jaber. 
23 For an interesting discussion and references, see R. H. Romer, Am. J. Phys. 63, 777 (1995). 
24 See J. Higbie, Am. J. Phys. 56, 378 (1988). 
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On these two assumptions, determine the radius and angular velocity of the electron. What 
is their product, wR? Does this classical model make sense? 


Problem 8.20. Work out the formulas for u, S, g, and T inthe presence of magnetic charge. 
[Hint: Start with the generalized Maxwell equations (Eq. 7.53) and Lorentz force law (Eq. 8.44), 
and follow the derivations in Sections 8.1.2, 8.2.2, and 8.2.3.] 


Problem 8.21.” Suppose you had an electric charge qe and a magnetic monopole qm. The 


field of the electric charge is 


1 gers 

= ATEQ pe 
(of course), and the field of the magnetic monopole is 

0 dm 5 
B= = ~ 

Find the total angular momentum stored in the fields, if the two charges are separated by a 
distance d. [Answer: (119/470) qeqm-/°° What does the conserved quantity Q in Prob. 5.48 
represent, physically? 


Problem 8.22. A magnetic dipole m = mż is at rest at the origin; an electric charge qe is at 
rest at r. Find the angular momentum in their fields. [Hint: The fastest way to get it is to exploit 
the result of Prob. 8.21.] 


Problem 8.23. Consider a stationary ideal magnetic dipole m in a static electric field E. Show 
that the fields carry momentum 


Pp = —€o//o(m x E). (8.45) 


Here E is evaluated at the location ofthe dipole. We stipulate that the electric field is localized (it 
goes to zero at infinity), otherwise the momentum is ill defined (the integral does not converge). 
[Hint: There are several ways to do this. The simplest is to start with p = €o f (Œ x B) dt, 
write E = —VV, and use integration by parts to show that 


p= eoo | VJ dt. (8.46) 


25 This system is known as Thomson’s dipole. See I. Adawi, Am. J. Phys. 44, 762 (1976) 


and Phys. Rev. D31, 3301 (1985), and K. R. Brownstein, Am. J. Phys. 57, 420 (1989), for 
discussion and references. 

Note that this result is independent of the separation distance d! It points from qe toward 
qm. In quantum mechanics, angular momentum comes in half-integer multiples of fi, so 
this result suggests that if magnetic monopoles exist, electric and magnetic charge must be 
quantized: Uogegm/4n = nh/2, forn = 1,2,3,..., an idea first proposed by Dirac in 1931. If 
even one monopole is lurking somewhere in the Universe, this would “explain” why electric 
charge comes in discrete units. (However, see D. Singleton, Am. J. Phys. 66, 697 (1998) for 
a cautionary note.) 
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So far, this is valid for any localized static configuration. For a current confined to an infinitesimal 
neighborhood of the origin we can approximate V(r) ~ V(0) — E(0) - r. Treat the dipole 
as a current loop, and use Eqs. 5.82 and 1.108.” 


Problem 8.24. Because the cylinders in Ex. 8.4 are left rotating (at angular velocities wa and 
Wp, Say), there is actually a residual magnetic field, and hence angular momentum in the 
fields, even after the current in the solenoid has been extinguished. If the cylinders are heavy, 
this correction will be negligible, but it is interesting to do the problem without making that 
assumption.” 


(a) Calculate (in terms of wa and œp) the final angular momentum in the fields. [Define w = 
WZ, S0 wa and wp could be positive or negative. ] 

(b) As the cylinders begin to rotate, their changing magnetic field induces an extra azimuthal 
electric field, which, in turn, will make an additional contribution to the torques. Find 
the resulting extra angular momentum, and compare it with your result in (a). [Answer: 
-uo wp (b? — a?)/4r/2.] 


Problem 8.25.” A point charge q is a distance a > R from the axis of an infinite solenoid 
(radius R, n turns per unit length, current 7). Find the linear momentum and the angular 
momentum (with respect to the origin) in the fields. (Put q on the x-axis, with the solenoid 
along z; treat the solenoid as a nonconductor, so you don't need to worry about induced charges 
on its surface.) [Answer: p = ( LgqnIR?/2a)¥; L = 0.] Whatifa < R? 


Problem 8.26. A point electric dipole p is located at the center of a spherical shell (radius R) 
carrying a surface current K = Ko sin Og. 


(a) Write down the electric and magnetic fields, inside and outside the sphere. [Hints: Don't 
forget the contact term in the electric field; for the magnetic field refer to Exs. 5.11 and 6.1, 
and express your answer in terms of the magnetic dipole moment m of the sphere.] 

(b) Calculate the momentum in the fields. 

(c) Suppose we turn off the electric field, by connecting a fine wire between the two ends of 
the dipole and allowing it to discharge quasistatically. Find the net impulse delivered to the 
current in the wire. [Hint: In parts (c) and (d) it may help to set the z-axis along p, instead 
ofm, but always express your final answer in coordinate-independent form. ] 


27 Asit stands, Eq. 8.45 is valid only for ideal dipoles. But g is linear in B, and therefore, if E is 
held fixed, obeys the superposition principle: for a collection of magnetic dipoles, the total 
momentum is the (vector) sum of the momenta for each one separately. In particular, if E 
is uniform over a localized steady current distribution, then Eq. 8.45 is valid for the whole 
thing, only now m is the total magnetic dipole moment. 

28 This problem was suggested by Paul DeYoung. 

29 See F. S. Johnson, B. L. Cragin, and R. R. Hodges, Am. J. Phys. 62, 33 (1994), and B. Y.-K. 
Hu, Eur. J. Phys. 33, 873 (2012) for discussion of this and related problems. 
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(d) Meanwhile, the changing electric field induces a magnetic field, Bing; what is this field, 
and what impulse does it deliver to the sphere? 

(e) Suppose instead we turn off the magnetic field, by gradually decreasing Ko. The changing 
magnetic field induces an electric field, Eina. What impulse does this deliver to p? 

(f) Comment on your results. Is momentum conserved, here? 


Problem 8.27. 


(a) Carry through the argument in Section 8.1.2, starting with Eq. 8.6, but using J in place of 
J. Show that the Poynting vector becomes 


S=ExH, (8.47) 
and the rate of change of the energy density in the fields is 
Ou oD OB 
Ot =E. Ot +H. GE 
For linear media, show that?! 
1 
u=>(E-D+B-H). (8.48) 


(b) In the same spirit, reproduce the argument in Section 8.2.2, starting with Eq. 8.14, with py 
and J in place of p and J. Don’t bother to construct the Maxwell stress tensor, but do 
show that the momentum density is? 


g=DxB. (8.49) 


Problem 8.28. A circular disk of radius R and mass M carries n point charges (q), attached 
at regular intervals around its rim. At time ¢ = O the disk lies in the xy plane, with its center 
at the origin, and is rotating about the z-axis with angular velocity wo, when itis released. The 
disk is immersed in a (time-independent) external magnetic field 


B(s, z) = k(—ss + 227), 
where & is a constant. 


(a) Find the position of the center of the ring, z(¢), and its angular velocity, œ (£), as functions 
of time. (Ignore gravity.) 

(b) Describe the motion, and check that the total (kinetic) energy — translational plus rota- 
tional — is constant, confirming that the magnetic force does no work. 


This is not the end of the story, as we shall see in Chapter 12 (Prob. 12.43). 


Refer to Section 4.4.3 for the meaning of “energy” in this context. 

For over 100 years there has been a raging debate (still not completely resolved) as to 
whether the field momentum in polarizable/magnetizable media is Eq. 8.49 (Minkowski’s 
candidate) or €o uo (E x H) (Abraham’s). See D. J. Griffiths, Am. J. Phys. 80, 7 (2012). 
This cute problem is due to K. T. McDonald, http://kirkmed.princeton.edu/examples/disk 
.pdf (who draws a somewhat different conclusion). 
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Problem 8.29. A particle with magnetic dipole moment m = m2 and (positive) charge Q 
is fixed at the origin. A particle of (negative) charge q is released from rest at infinity, and “falls” 
in, attracted by Q and deflected by the magnetic field of m. Its resulting trajectory is r(¢) = 
(r(t), O(t), P(t). 


(a) Show that the z component of the torque on q is 


ogm 


y a a 
4T r2 r 4n dt 
and hence the z component of its angular momentum is 


r 


_ ogm sin? 0 
4r r 


L, = 


(b) Where did this angular momentum come from? [Hint: Refer to Prob. 8.22.] 


Electromagnetic Waves 


9.1 Waves in One Dimension 


9.1.1 The Wave Equation 


What is a “wave”? I don’t think I can give you an entirely satisfactory 
answer — the concept is intrinsically somewhat vague — but here’s a start: 
A wave is a disturbance of a continuous medium that propagates with a fixed 
shape at constant velocity. Immediately I must add qualifiers: in the presence 
of absorption, the wave will diminish in size as it moves; if the medium is 
dispersive, different frequencies travel at different speeds; in two or three 
dimensions, as the wave spreads out, its amplitude will decrease; and of 
course standing waves don’t propagate at all. But these are refinements; 
let’s start with the simple case: fixed shape, constant speed, one dimension 
(Fig. 9.1). 

How would you represent such an object mathematically? In Fig. 9.1, 
I have drawn the wave at two different times, once at t = 0, and again at some 
later time ¢ — each point on the wave form simply shifts to the right by an 
amount vt, where v is the velocity. Maybe the wave is generated by shaking 
one end of a taut string; f(z, t) represents the displacement of the string at 
the point z, at time ¢. Given the initial shape of the string, g(z) = f(z,0), 
what is the subsequent form, f(z, t)? Well, the displacement at point z, at 
the later time ¢, is the same as the displacement a distance vt to the left (that 
is, at z — vt), back at time ¢ = 0: 

f(z, =f - vt,0) = g(z- vt). (9.1) 

That statement captures (mathematically) the essence of wave motion. 
It tells us that the function f (z, t), which might have depended on z and ¢ in 
any old way, in fact depends on them only in the very special combination 
z—vt; when that is true, the function f (z, t) represents a wave of fixed shape 
traveling in the z direction at speed v. For example, if A and b are constants 
(with the appropriate units), 
A@od = Ae Pe" | A(z, t) = Asin[b(z- vt], B= Nene 


all represent waves (with different shapes, of course), but 
faz, t) = Ae") and —fs(z, t) = Asin(bz) cos(bvt)3 


do not. 


9.1 Waves in One Dimension 


Why does a stretched string support wave motion? Actually, it follows 
from Newton’s second law. Imagine a very long string under tension T. 
If it is displaced from equilibrium, the net transverse force on the segment 
between z and z + Az (Fig. 9.2) is 


AF = Tsin 0’ — T sin 0, 


where 0’ is the angle the string makes with the z direction at point z + Az, 
and 0 is the corresponding angle at point z. Provided that the distortion of 
the string is not too great, these angles are small (the figure is exaggerated, 
obviously), and we can replace the sine by the tangent: 


a f _ pf of OT es 
AF x T(tan 6’ — tan 0) = a as = ) x T32 Az. 
If the mass per unit length is u, Newton’s second law says 
arf 
AF = Az) > 
and therefore 
OF, LoT 
2 T Ar’ 
Evidently, small disturbances on the string satisfy 
Of 10f 
amt POR” seek 


where v (which, as we’ll soon see, represents the speed of propagation) is 


v=,]—. (9.3) 
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Equation 9.2 is known as the (classical) wave equation, because it admits 
as solutions all functions of the form 


f(z, = gz - vt) (9.4) 


(that is, all functions that depend on the variables z and ¢ in the special 
combination u = z — vt), and we have just learned that such functions 
represent waves propagating in the z direction with speed v. 


Proof. 

Of _dgdu _ dg Of _dgðu __ dg 

Oz dudz du Ot dudt dw 
and 

E E a La 

022 ðz\du’ duz âz du?’ 

Of ð dg, dgõðu dg 

A = ae ot” a 
so 


Pg Pf 18f 


= = . o 
du? 02 v ôt 


Note that g(u) can be any (differentiable) function whatever.! If the 
disturbance propagates at constant speed without changing its shape, then 
it satisfies the wave equation. 

But functions of the form g(z — vt) are not the only solutions. The wave 
equation involves the square of v, so we can generate another class of 
solutions by simply changing the sign of the velocity: 


f(z, = h(z + vt). (9.5) 


This, of course, represents a wave propagating in the negative z direction, 
and it is certainly reasonable (on physical grounds) that such solutions 
would be allowed. What is perhaps surprising is that the most general 
solution to the wave equation is the sum of a wave to the right and a wave 
to the left: 


f(z, =g(z- vt) +h(z+ vt). (9.6) 


(Notice that the wave equation is linear: the sum of any two solutions is 
itself a solution.) As it turns out, every solution to the wave equation can 
be expressed in this form. 

Like the simple harmonic oscillator equation, the wave equation is 
ubiquitous in physics. If something is vibrating, the oscillator equation 
is almost certainly responsible (at least, for small amplitudes), and if 
something is waving (whether the context is mechanics or acoustics, optics 


l Does it surprise you that the class of solutions to the wave equation is so huge? After all, an 
ordinary second-order differential equation typically admits just two linearly independent 
solutions .... That’s right, but there’s no such rule for partial differential equations. 
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or oceanography), the wave equation (perhaps with some decoration) is 
bound to be involved. 


Problem 9.1. By explicit differentiation, check that the functions f1, f2, and f3 in the text 
satisfy the wave equation. Show that f4 and f5 do not. 


Problem 9.2. Show that the standing wave f (z, £) = A sin(kz) cos(kvt) satisfies the 
wave equation, and express it as the sum of a wave traveling to the left and a wave traveling to 
the right (Eq. 9.6). 


9.1.2 Sinusoidal Waves 


(i) Terminology. Of all possible wave forms, the sinusoidal one 
f(z, t) = Acos[k(z — vt) + 6] (9.7) 


is (for good reason) the most familiar. Figure 9.3 shows this function at 
time ¢ = 0. The quantity A is the amplitude of the wave (it is positive, and 
represents the maximum displacement from equilibrium). The argument 
of the cosine is called the phase, and 6 is the phase constant (obviously, you 
can add any integer multiple of 27 to ô without changing f (z, t); ordinarily, 
one uses a value in the range 0 < 6 < 27, but sometimes -7 < 6 < 7, for 
example, may be more convenient). Notice that at z = vt — 6/k, the phase is 
zero; let’s call this the “central maximum.” If 6 = 0, the central maximum 
passes the origin at time ¢ = 0; more generally, 6/k is the distance by which 
the central maximum (and therefore the entire wave) is “delayed.” Finally, 
k is the wave number; it is related to the wavelength A by the equation 


_ 2n 

aoe 

for when z advances by 27¢/k, the cosine executes one complete cycle. 
As time passes, the entire wave train proceeds to the right, at speed v. 

At any given point z, the string vibrates up and down, undergoing one full 

cycle in a period 


A (9.8) 


27 
T=, i 
T (9.9) 
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Ps maximum 
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The frequency v (number of oscillations per unit time) is 


1 kv v 
v= ==> =. 9.10 
T 2m A ( ) 
For our purposes, a more convenient unit is the angular frequency w, 
so called because in the analogous case of uniform circular motion, it 


represents the number of radians swept out per unit time: 


w =2rnv = ky, so | w= kv. (9.11) 


Ordinarily, it’s nicer to write sinusoidal waves (Eq. 9.7) in terms of w, 
instead of v:? 


f(z, t) = Acos(kz — wt + ô). (9.12) 


A sinusoidal oscillation of wave number k and (angular) frequency w 
traveling to the left would be written 


f(z, t) = Acos(kz + wt — ô). (9.13) 


The sign of the phase constant is chosen for consistency with our previous 
convention that ô/k shall represent the distance by which the wave is 
“delayed” (since the wave is now moving to the left, a delay means a shift 
to the right). At t = 0, the wave looks like Fig. 9.4. Because the cosine is an 
even function, we can just as well write Eq. 9.13 thus: 


f(z, t) = A cos(—kz — wt + ô). (9.14) 


Comparison with Eq. 9.12 reveals that we could simply switch the sign of k 
to produce a wave with the same amplitude, phase constant, frequency, and 
wavelength, traveling in the opposite direction. 


(ii) Complex notation. In view of Euler’s formula, 
e? = cos 0 + isin 0, (9.15) 
the sinusoidal wave (Eq. 9.12) can be written 


f(t) = Re[Ael@?-0)}, (9.16) 


2 I put a box around the equation w = ky, because the relation between wave number (or 
wavelength) and frequency (v or w) is fundamental, and it really ought to have a memorable 
name. Essentially the same idea can be expressed in several different ways: vA = v, Aw = 
2Tv, etc. 
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where Re(&) denotes the real part of the complex number &. This invites us 
to introduce the complex wave function 


f(z, t) = Aeon, (9.17) 


with the complex amplitude A = Ae” subsuming both the (positive) real 
amplitude A and the phase constant 6. The actual wave function is the real 
part of f: 


f(z, t) = Relf(z, ©]. (9.18) 


If you know f, it is a simple matter to find f; the advantage of the complex 
notation is that exponentials are much easier to manipulate than sines and 
cosines. 


Example 9.1. Suppose you want to add two sinusoidal waves: 
f =fi tf = Refi) + Re(f/2) = Reffi +f2) = Refs), 


with f 3 = f it f 2. You simply add the corresponding complex wave 
functions, and then take the real part. In particular, if they have the 
same frequency and wave number, 


fs = Aei ob) + Asel=-@) — Azéma, 


where 
Ay = Aj + Ad, or Aze’ = Ajet’! + Ane®®?, (9.19) 


In other words, you just add the (complex) amplitudes. The combined 
wave still has the same frequency and wavelength, 


f(z, t) = A3 cos(kz — wt + 63), 


and you can easily figure out A3 and 63 from Eq. 9.19 (see Prob. 9.3). 
Try doing this without using the complex notation — you will find 
yourself looking up trig identities and wading through nasty algebra. 


Caveat: The complex notation is great for adding sinusoidal functions, 
because 


Re(f + f2) = Re(f1) + Re(f>), 
but not so useful for multiplying them: 
Re(fi -f2) # Re(f1) - Re(f/2). 
Be on guard, therefore, when you encounter quantities (energy, for exam- 


ple) that are quadratic. There are tricks for handling special cases (see 
Prob. 9.12), but the fireproof method is to revert to the real notation. 
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(iii) Linear combinations of sinusoidal waves. Although the sinusoidal 
function (Eq. 9.17) is a very special wave form, the fact is that any wave 
can be expressed as a linear combination of sinusoidal ones: 


fed = Í * aoet dk. (9.20) 


o0 


Here w is a function of k (Eq. 9.11), and I have allowed k to run negative 
in order to include waves going both directions.’ 

The formula for A(k), in terms of the initial conditions f(z,0) and 
Î (z,0), can be obtained from the theory of Fourier transforms (see 
Prob. 9.34), but the details are not relevant to my purpose here. The point 
is that any wave can be written as a linear combination of sinusoidal waves, 
and therefore if you know how sinusoidal waves behave, you know in 
principle how any wave behaves. So from now on, we shall confine our 
attention to sinusoidal waves. 


Problem 9.3. Use Eq. 9.19 to determine 43 and 63 in terms of A1, A>, 61, and do. 


Problem 9.4. Obtain Eq. 9.20 directly from the wave equation, by separation of variables. 


9.1.3 Boundary Conditions: Reflection and Transmission 


So far, I have assumed the string is infinitely long — or at any rate long 
enough that we don’t need to worry about what happens to a wave when 
it reaches the end. As a matter of fact, what happens depends a lot on how 
the string is attached — that is, on the specific boundary conditions to which 
the wave is subject. Suppose, for instance, that the string is simply tied onto 
a second string. The tension T is the same for both, but the mass per unit 
length u, presumably, is not, and hence the wave velocities v; and v2 are 
different (remember, v = VT/uU). Let’s say, for convenience, that the knot 
occurs at z = 0. The incident wave 


Ji, D = Arde (z <0), (9.21) 
coming in from the left, gives rise to a reflected wave 
Fr(z,t) = Ape?) (z <0), (9.22) 


traveling back along string 1 (hence the minus sign in front of kı), in 
addition to a transmitted wave 


fr(z,t) = Apel) (z > 0), (9.23) 
which continues on to the right in string 2. 
3 This does not mean that A and w are negative — wavelength and frequency are intrinsically 


positive (like speed v and amplitude A). If we allow negative wave numbers, then Eqs. 9.8 
and 9.11 should really be written A = 27/|k| and w = |k\v. 
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The incident wave /7(z, t) is a sinusoidal oscillation that extends (in prin- 
ciple) all the way back to z = —oo, and has been doing so for all of history. 
The same goes for fr and fr (except that the latter, of course, extends to 
z= + œ). All parts of the system are oscillating at the same frequency w 
(a frequency determined by the person at z = —co, who is shaking the string 
in the first place). Since the wave velocities are different in the two strings, 
however, the wavelengths and wave numbers are also different: 


A, ko v» 
A k v 
Of course, this situation is pretty artificial — what’s more, with incident 
and reflected waves of infinite extent traveling on the same piece of string, 
it’s going to be hard for a spectator to tell them apart. You might therefore 
prefer to consider an incident wave of finite extent — say, the pulse shown in 
Fig. 9.5. You can work out the details for yourself, if you like (Prob. 9.5). 
The trouble with this approach is that no finite pulse is truly sinusoidal. The 
waves in Fig. 9.5 may look like sine functions, but they’re not: they're little 
pieces of sines, joined onto an entirely different function (namely, zero). 
Like any other waves, they can be built up as linear combinations of true 
sinusoidal functions (Eq. 9.20), but only by putting together a whole range 
of frequencies and wavelengths. If you want a single incident frequency (as 
we shall in the electromagnetic case), you must let your waves extend to 
infinity. (In practice, if you use a very long pulse, with many oscillations, it 
will be close to the ideal of a single frequency.) 
For a sinusoidal incident wave, then, the net disturbance of the string is 


(9.24) 


Areko) + A geiko) forz < 0, 


D 9.25 
Areto) for Z> 0. Á ) 


fed = | 
At the join (z = 0), the displacement just slightly to the left (z = 07) must 
equal the displacement slightly to the right (z = 0*) — else there would be a 
break in the string. Mathematically, f(z, t) is continuous at z = 0: 


TY A =f OD: (9.26) 


If the knot itself is of negligible mass, then the derivative of f must also be 
continuous: 
~ @z 


of of 
ae le = (9.27) 


or" 


NY 


f f 
vy ‘i al f i v2 


(a) Incident pulse (b) Reflected and transmitted pulses 
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T 
WO g 
T Knot T Knot 
> > 
Z Z 
(a) Discontinuous slope; force on knot (b) Continuous slope; no force on knot 


Otherwise there would be a net force on the knot, and therefore an infinite 
acceleration (Fig. 9.6). These boundary conditions apply directly to the real 
wave function f (z, t). But since the imaginary part of f differs from the real 
part only in the replacement of cosine by sine (Eq. 9.15), it follows that the 
complex wave function f (z, t) obeys the same rules: 


af) _ af 

azl = 3: 
When applied to Eq. 9.25, these boundary conditions determine the 

outgoing amplitudes (Ar and Ar) in terms of the incoming one (Ar): 


fO, t) =fO",d, 


(9.28) 


o+’ 


Är+Ar=Ár, ki(A;- Ar) = kAr, 


from which it follows that 


~ _ k-k- - _ ¢ 2k n 
Ar= (a rae Ar = (arg) ^ (9.29) 
Or, in terms of the velocities (Eq. 9.24): 
" vy — V ~ ~ 2v i 
Ag =(=—)4;, Ar = (——) Ar. (9.30) 
v + v1 v2 + V1 


The real amplitudes and phases, then, are related by 
; = “2 ; 2 ; 
Apes = (2 1) Aye, Are®? = (—- )Are®. (9.31) 
v2 + V] v2 + V1 


If the second string is lighter than the first (u2 < u1, so that v2 > vı), all 
three waves have the same phase angle (ôr = Or = ôr), and the outgoing 
amplitudes are 


va = Y] 2v 
AR= 7 Ar Ar = (G 7 5) Ar: (9.32) 


If the second string is heavier than the first (v2 < vı), the reflected wave is 
out of phase by 180° (ôr + T = Ôr = ôr). In other words, since 


cos(—kız — wt + ôr — T) = —cos(—kiz — wt + ôr), 


the reflected wave is “upside down.” The amplitudes in this case are 


- 2 
Ar= G - Ta 4r=(~ 2-A. (9.33) 
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In particular, if the second string is infinitely massive — or, what amounts to 
the same thing, if the first string is simply tied to an immovable post at the 
end — then 


Arp=A;, and Ar=0. 


Naturally, in this case there is no transmitted wave — all of it reflects back. 


Problem 9.5. Suppose you send an incident wave of specified shape, g7(z — vj), down 
string 1. It gives rise to a reflected wave, A R (z + vı £), and a transmitted wave, gr (z — vf). 
By imposing the boundary conditions 9.26 and 9.27, find hp and gr. 


Problem 9.6. 


(a) Formulate an appropriate boundary condition, to replace Eq. 9.27, for the case of two strings 
under tension 7 joined by a knot of mass 7. 

(b) Find the amplitude and phase of the reflected and transmitted waves for the case where 
the knot has mass 77 and the second string is massless. 


Problem 9.7. Suppose string 2 is embedded in a viscous medium (such as molasses), which 
imposes a drag force that is proportional to its (transverse) speed: 


ð 
Na = -Y OF ns 


(a) Derive the modified wave equation describing the motion of the string. 

(b) Solve this equation, assuming the string vibrates at the incident frequency œw. That is, look 
for solutions of the form f (z, t) = e@* F(z). 

(c) Show that the waves are attenuated (that is, their amplitude decreases with increasing z). 
Find the characteristic penetration distance, at which the amplitude is 1/e of its original 
value, in terms of y, T, u, and w. 

(d) If a wave of amplitude 47, phase 6; =0, and frequency w is incident from the left 
(string 1), find the reflected wave's amplitude and phase. 


9.1.4 Polarization 


The waves that travel down a string when you shake it are called transverse, 
because the displacement is perpendicular to the direction of propagation. 
If the string is reasonably elastic, it is also possible to stimulate compression 
waves, by giving the string little tugs. Compression waves are hard to see on a 
string, but if you try it with a slinky they’re quite noticeable (Fig. 9.7). These 
waves are called longitudinal, because the displacement from equilibrium is 
along the direction of propagation. Sound waves, which are nothing but 
compression waves in air, are longitudinal; electromagnetic waves, as we 
shall see, are transverse. 
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(c) Polarization vector 


Now there are, of course, two dimensions perpendicular to any given line 
of propagation. Accordingly, transverse waves occur in two independent 
states of polarization: you can shake the string up-and-down (“vertical” 
polarization — Fig. 9.8a), 


f,(z,0) = Ag", (9.34) 
or left-and-right (“horizontal” polarization — Fig. 9.8b), 

fi, (z, t) = Ae 5, (9.35) 
or along any other direction in the xy plane (Fig. 9.8c): 

f(z, t) = Aeon. (9.36) 


The polarization vector ñ defines the plane of vibration.* Because the waves 
are transverse, i is perpendicular to the direction of propagation: 


n-z=0. (9.37) 
In terms of the polarization angle 0, 
û = cos 0 Å + sin 0 Î. (9.38) 


Thus, the wave pictured in Fig. 9.8c can be considered a superposition of 
two waves — one horizontally polarized, the other vertically: 


f(z, t) = (A cos Oe" X + (A sin O) eA, (9.39) 


4 Notice that you can always switch the sign of fi, provided you simultaneously advance the 
phase constant by 180°, since both operations change the sign of the wave function. 
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Problem 9.8. Equation 9.36 describes the most general linearly polarized wave on a string. 
Linear (or “plane”) polarization (so called because the displacement is parallel to a fixed vector 
ñ) results from the combination of horizontally and vertically polarized waves of the same phase 
(Eq. 9.39). If the two components are of equal amplitude, but out of phase by 90° (say, 6, = 0, 
ôn = 90°), the result is a circularly polarized wave. 


(a) At a fixed point z, show that the string moves in a circle about the z-axis. Does it go 
clockwise or counterclockwise, as you look down the axis toward the origin? How would 
you construct a wave circling the other way? (In optics, the clockwise case is called right 
circular polarization, and the counterclockwise, left circular polarization.)° 

(b) Sketch the string at time ¢ = 0. 

(c) How would you shake the string in order to produce a circularly polarized wave? 


9.2 Electromagnetic Waves in Vacuum 
SS SS eee 


9.2.1 The Wave Equation for E and B 


In regions of space where there is no charge or current, Maxwell’s equations 


read 


(i) V-E=0, Gi) VxE=-5, 
i (9.40) 


(ii) V-B=0, (iv) VxBe= poco 


They constitute a set of coupled, first-order, partial differential equations 
for E and B. They can be decoupled by applying the curl to (iii) and (iv): 


6B 
= 2h — 
Vx(VxE)=V(V-E)-V E=vx(-7-) 
ð 2E 
= -—(V x B) = — uo€o — 
gi Y X B) = -oeo PTA 
Vx(VxB)=V(V-B)-V B = V x (uoeo>) 
ð 2B 
ad toeo; (Y x E) = THEa? 
or, since V - E = 0 and V -B = 0, 
2E 3B 
VE = —, VB= = 9.41 
HoEo an LHo€o Ok ( ) 


> An elegant notation for circular polarization (or elliptical, if the amplitudes are unequal) is 
to use a complex ñ, but I shall not do so in this book. 
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We now have separate equations for E and B, but they are of second order; 
that’s the price you pay for decoupling them. 

In vacuum, then, each Cartesian component of E and B satisfies the 
three-dimensional wave equation, 


(This is the same as Eq. 9.2, except that 07//0z7 is replaced by its natural 
generalization, Vf.) So Maxwell’s equations imply that empty space sup- 
ports the propagation of electromagnetic waves, traveling at velocity 


1 


v= = 3.00 x 10° m/s, 9.42 
ma (9.42) 
which happens to be precisely the speed of light, c. Evidently, 
1 
c= ; (9.43) 
VE0HO 


The implication is astounding: perhaps light is an electromagnetic wave.° 
Of course, this conclusion does not surprise anyone today, but imagine 
what a revelation it must have been in Maxwell’s time! Remember how 
€o and uo came into the theory in the first place: they were constants in 
Coulomb’s law and the Biot—Savart law, respectively. You measure them in 
experiments involving charged pith balls, batteries, and wires — experiments 
having nothing whatever to do with light. And yet, according to Maxwell’s 
theory, you can calculate c from these two numbers. Notice the crucial role 
played by Maxwell’s contribution to Ampére’s law (10€90E/02): without it, 
the wave equation would not emerge, and there would be no electromagnetic 
theory of light. 


9.2.2 Monochromatic Plane Waves 


For reasons discussed in Section 9.1.2, we may confine our attention to 
sinusoidal waves of frequency w. Since different frequencies in the visible 
range correspond to different colors, such waves are called monochromatic 
(Table 9.1). Suppose, moreover, that the waves are traveling in the z direc- 
tion and have no x or y dependence; these are called plane waves,’ because 
the fields are uniform over every plane perpendicular to the direction of 
propagation (Fig. 9.9). We are interested, then, in fields of the form 


6 As Maxwell himself put it, “We can scarcely avoid the inference that light consists in the 
transverse undulations of the same medium which is the cause of electric and magnetic 
phenomena.” See I. Tolstoy, James Clerk Maxwell, A Biography, University of Chicago 
Press, Chicago (1983). 

7 For a discussion of spherical waves, at this level, see J. R. Reitz, F. J. Milford, and R. W. 
Christy, Foundations of Electromagnetic Theory, 3rd ed., Addison-Wesley, Reading, MA 
(1979), Sect. 17-5. Or work Prob. 9.37. Of course, over small enough regions any wave is 
essentially plane, as long as the wavelength is much less than the radius of the curvature of 
the wave front. 
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Table 9.1. 


The Electromagnetic Spectrum 


Frequency (Hz) Type Wavelength (m) 
1072 10-33 
10?! Gamma rays 10-? 
102° 1071! 
10! 107! 
10!8 X rays 10° 
107 10-8 
10!6 Ultraviolet 107 
10!5 Visible 1076 
10!4 Infrared 10> 
102 1074 
10! 107 
10! 10° 
10!° Microwave 107! 
10° 1 
108 TV, FM 10 
107 10? 
10° AM 10° 
10° 104 
104 RF 10° 
10° 10° 
The Visible Range 
Frequency (Hz) Color Wavelength (m) 
1.0x 10 Near ultraviolet 3.0 x 10-7 
7.5 x 10!4 Shortest visible blue 4.0 x 107 
6.5 x 10! Blue 4.6 x 107 
5.6 x 10!4 Green 5.4x 1077 
5.1 x 10“ Yellow 5.9x 1077 
4.9 x 104 Orange 6.1 x 1077 
3.9 x 10!4 Longest visible red 7.6 x 1077 
3.0 x 10!4 Near infrared 1.0 x 1076 
E(z, t) = Ege”), Bz, O = Boe“, (9.44) 


where Ep and Bo are the (complex) amplitudes (the physical fields, of course, 
are the real parts of E and B), and w = ck. 

Now, the wave equations for E and B (Eq. 9.41) were derived from 
Maxwell’s equations. However, whereas every solution to Maxwell’s equa- 
tions (in empty space) must obey the wave equation, the converse is not true; 
Maxwell’s equations impose extra constraints on Eo and Bo. In particular, 
since V- E = 0 and V - B = 0, it follows® that 


8 Because the real part of E differs from the imaginary part only in the replacement of sine 
by cosine, if the former obeys Maxwell’s equations, so does the latter, and hence E as well. 
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y 


(Eo): = (Bo): = 0. (9.45) 


That is, electromagnetic waves are transverse: the electric and magnetic fields 
are perpendicular to the direction of propagation. Moreover, Faraday’s 
law, V x E = -0B/0t, implies a relation between the electric and magnetic 
amplitudes, to wit, 


— k(Eo)y = @(Bo)x, k(Ēo)x = w (Bo), (9.46) 
or, more compactly, 
Bo = “ax Ëo). (9.47) 


Evidently, E and B are mutually perpendicular, and perpendicular to the 
direction of propagation (here, z), so they really have only one (nonzero) 
component each; they are in phase, and their (real) amplitudes are related by 


Re! 


ee ee (9.48) 
w É 


The fourth of Maxwell’s equations, V xB = uoeo(E/ðt), does not yield an 
independent condition; it simply reproduces Eq. 9.46. 
Example 9.2. If E points in the x direction, then B points in the 
y direction (Eq. 9.47): 
n PE ee Le. yeas 
E(z, t) = Eod, B(z, t) = Eeten 


or (taking the real part) 


1 7 
E(z, t) = Eo cos(kz — wt + ô), B(z,t) = zE cos(kz — wt + d)y. 


(9.49) 


This is the paradigm for a monochromatic plane wave (see Fig. 9.10). 
The wave as a whole is said to be polarized in the x direction (by 
convention, we use the direction of E to specify the polarization of 
an electromagnetic wave). 
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There is nothing special about the z direction, of course — we can 
easily generalize to monochromatic plane waves traveling in an arbitrary 
direction. The notation is facilitated by the introduction of the propagation 
(or wave) vector, k, pointing in the direction of propagation, whose mag- 
nitude is the wave number k. The scalar product k - r is the appropriate 
generalization of kz (Fig. 9.11), so 


Er, ) = Ep, 


£ Le cree a, ha (9.50) 
Bar, ) = zE" r-v) (k x ñ) = ck xE, 
where ñ is the polarization vector. Because E is transverse, 
i-k =0. (9.51) 


(The transversality of B follows automatically from Eq. 9.50.) The actual 
(real) electric and magnetic fields in a monochromatic plane wave with 
propagation vector k and polarization ñ are 


Kcr, t) = Eo cos(k-r — wt + 6)n, (9.52) 


Br, t) = LE cos(k -r — wt + 8) (k x fi). (9.53) 


Problem 9.9. Write down the (real) electric and magnetic fields for a monochromatic plane 
wave of amplitude Zo, frequency w, and phase angle zero that is (a) traveling in the negative 
x direction and polarized in the z direction, and (b) traveling in the direction from the origin to 
the point (1, 1, 1), with polarization parallel to the xz plane. In each case, sketch the wave, 
and give the explicit Cartesian components of k and ñ. 
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9.2.3 Energy and Momentum in Electromagnetic Waves 


According to Eq. 8.5, the energy per unit volume in electromagnetic fields is 
1 2, lpm 

u= 5 (cok + Ti ). (9.54) 

In the case of a monochromatic plane wave (Eq. 9.49 or 9.53),? 

1 
B= ae = Loco”, (9.55) 
so the electric and magnetic contributions are equal: 

u= eE = coE? cos? (kz — wt + 6). (9.56) 
As the wave travels, it carries this energy along with it. The energy flux 


density (energy per unit area, per unit time) transported by the fields is given 
by the Poynting vector (Eq. 8.10): 


S= te x B). (9.57) 
Lo 


For monochromatic plane waves propagating in the z direction, 
S= ceo Ey cos? (kz — wt + 5)Z = cua. (9.58) 


Notice that S is the energy density (u) times the velocity of the waves (cz) — as 
it should be. For in a time At, a length cAt passes through area A (Fig. 9.12), 
carrying with it an energy wAcAt. The energy per unit time, per unit area, 
transported by the wave is therefore uc. 

Electromagnetic fields not only carry energy, they also carry momentum. 
In fact, we found in Eq. 8.29 that the momentum density stored in the 
fields is 


1 
g=—S. (9.59) 
Cc 


? Notice that I’m using the real fields to calculate energy density. You might be tempted to 
introduce 


ll F2 1 p72 
u= 7 (€oF $ m ), 
and take its real part, but 
Re(u) = eoE, cos[2(k -r —- wt + ô)]| + u. 


Why doesn’t this work? 
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For monochromatic plane waves, then, 
1 1 
g= zEoEo cos? (kz — wt + 6)z = zô. (9.60) 


In the case of light, the wavelength is so short (~ 5 x 1077m), and 
the period so brief (~ 107!5s), that any macroscopic measurement will 
encompass many cycles. Typically, therefore, we’re not interested in the 
rapidly fluctuating cosine-squared term in the energy and momentum 
densities; all we want is the average value. Now, the average of cosine- 
squared over a complete cycle!” is 1/2, so 


(u) = Leok, (9.61) 
1 

(S) = 5 Ceo kg, (9.62) 
1 F 

(g) = 5, C02. (9.63) 


I use brackets, ( ), to denote the (time) average over a complete cycle (or 
many cycles, if you prefer). The average power per unit area transported by 
an electromagnetic wave is called the intensity: 


1 
I=($)= zceoEo: (9.64) 


When light falls (at normal incidence) on a perfect absorber, it delivers 
its momentum to the surface. In a time Aż, the momentum transfer is 
(Fig. 9.12) Ap = (g)AcAt, so the radiation pressure (average force per unit 
area) is 


P=- = 5608 = (9.65) 


(On a perfect reflector the pressure is twice as great, because the momentum 
switches direction, instead of simply being absorbed.) We can account for 
this pressure qualitatively, as follows: The electric field (Fig. 9.10) drives 
charges in the x direction, and the magnetic field then exerts on them a force 
q(v x B) in the z direction. The net force on all the charges in the surface 
produces the pressure.!! 


Problem 9.10. The intensity of sunlight hitting the Earth is about 1300 W/m2. If sunlight 
strikes a perfect absorber, what pressure does it exert? How about a perfect reflector? What 
fraction of atmospheric pressure does this amount to? 


10 There is a cute trick for doing this in your head: sin? O + cos? @ = 1, and over a complete 
cycle the average of sin? 0 is equal to the average of cos? 0, so (sin?) = (cos?) = 1/2. 
Formally, 


L [costes —2nut + o)dr= } 
T Jo Zz ie 


i Actually, it’s a little more subtle than this — see Prob. 9.11. 
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Problem 9.11. Consider a particle of charge q and mass 7n, free to move in the xy plane in 
response to an electromagnetic wave propagating in the z direction (Eq. 9.49 — might as well 
setó = 0). 


(a) Ignoring the magnetic force, find the velocity of the particle, as a function of time. (Assume 
the average velocity is zero.) 

(b) Now calculate the resulting magnetic force on the particle. 

(c) Show that the (time) average magnetic force is zero. 


The problem with this naïve model for the pressure of light is that the velocity is 90° out of 
phase with the fields. For energy to be absorbed, there's got to be some resistance to the motion 
of the charges. Suppose we include a force of the form —ymv, for some damping constant y. 


(d) Repeat part (a) (ignore the exponentially damped transient). Repeat part (b), and find the 
average magnetic force on the particle.” 


Problem 9.12. In the complex notation there is a clever device for finding the time average 
of a product. Suppose f(r, £t) = A cos(k -r — wt + ôa) and g(r, t) = Bcos(k - r — 
wt + ôp). Show that (fg) = (1/2) Re(f *), where the star denotes complex conjugation. 
[Note that this only works if the two waves have the same k and w, but they need not have the 
same amplitude or phase.] For example, 


1 ~ =» |. g 1 = a6 
=7Re(eoQE-E +—B-B) and (S)=-— ReŒxB ). 
(u) = 7 Re(eo TBB) Son 


Problem 9.13. Find all elements of the Maxwell stress tensor for a monochromatic plane 
wave traveling in the z direction and linearly polarized in the x direction (Eq. 9.49). Does your 
answer make sense? (Remember that IT represents the momentum flux density.) How is the 
momentum flux density related to the energy density, in this case? 


9.3 Electromagnetic Waves in Matter 
Ca R] 


9.3.1 Propagation in Linear Media 


Inside matter, but in regions where there is no free charge or free current, 
Maxwell’s equations become 


OB 


(i) V-D=0, (ii) VxE=-—, 
a (9.66) 
(ii) V-B=0, (iv) VxH=—. 
If the medium is linear, 
D=cE, H= lp, (9.67) 


12 C, E. Mungan, Am. J. Phys. 77, 965 (2009). See also Prob. 9.36. 
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and homogeneous (so € and u do not vary from point to point), they 


reduce to 


(i) V-E=0, (ii) VxE=-5, 
(9.68) 


ai) V-B=0, (iv) Vx B= pe, 


which differ from the vacuum analogs (Eqs. 9.40) only in the replacement of 
Lo€o by ue.!? Evidently, electromagnetic waves propagate through a linear 
homogeneous medium at a speed 


=E (9.69) 


ep 
= ,_/— 9.70 
n ea (9.70) 


is the index of refraction of the substance. For most materials, u is very close 
to Ho, SO 


where 


nx VE,, (9.71) 


where e, is the dielectric constant!* (Eq. 4.34). Since e, is almost always 
greater than 1, light travels more slowly through matter — a fact that is well 
known from optics. 

All of our previous results carry over, with the simple transcription €o > 
€, [lo > u, and hence c > v. The energy density is!’ 


lme lpg 
u= (eE 4 a ), (9.72) 
and the Poynting vector is 
1 
S = —(E x B). (9.73) 


u 
For monochromatic plane waves, the frequency and wave number are 


related by w = kv (Eq. 9.11), the amplitude of B is 1/v times the amplitude 
of E (Eq. 9.48), and the intensity!® is 


13 This observation is mathematically pretty trivial, but the physical implications are extraor- 
dinary: As the wave passes through, the fields busily polarize and magnetize all the 
molecules, and the resulting (oscillating) dipoles create their own electric and magnetic 
fields. These combine with the original fields in such a way as to create a single wave with 
the same frequency but a different speed. This remarkable conspiracy (known in optics as 
the Ewald—Oseen extinction theorem) is responsible for the phenomenon of transparency. 
It is a distinctly nontrivial consequence of linearity. For further discussion see M. B. James 
and D. J. Griffiths, Am. J. Phys. 60, 309 (1992); H. Fearn, D. F. V. James, and P. W. Milonni, 
Am. J. Phys. 64, 986 (1996); M. Mansuripur, Opt. Photonics News 9, 50 (1998). 

The dielectric constant is “constant” in the sense of being independent of the amplitude of 
E, but it may well depend on the frequency, as we shall see. Thus, for example, if you quote 
the (static) dielectric constant for water, from Table 4.2, you will conclude that the index of 
refraction is 8.9, which is wildly off, for visible light (n = 1.33). 

See Prob. 8.27; refer to Section 4.4.3 for the precise meaning of “energy density,” in the 
context of linear media. 

The momentum carried by an electromagnetic wave in matter is controversial. See, for 
example, S. M. Barnett, Phys. Rev. Lett. 104, 070401 (2010). 
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1 
I= zE (9.74) 


The interesting question is this: What happens when a wave passes 
from one transparent medium into another — air to water, say, or glass to 
plastic? As in the case of waves on a string, we expect to get a reflected 
wave and a transmitted wave. The details depend on the exact nature of 
the electrodynamic boundary conditions, which we derived in Chapter 7 
(Eq. 7.74): 

(i) eE} = eE}, (iii) E! = El, 


1 (9.75) 


1 

as > I I 

(ii) By = B, (iv) —B/ = —B;. 
í Hi! m? 

These equations relate the electric and magnetic fields on either side of the 

interface between two linear media. In the following sections, we use them 

to deduce the laws governing reflection and refraction of electromagnetic 


Waves. 
9.3.2 Reflection and Transmission at Normal Incidence 


Suppose the xy plane forms the boundary between two linear media. 
A plane wave of frequency w, traveling in the z direction and polarized 
in the x direction, approaches the interface from the left (Fig. 9.13): 


E,(z, t) = Ëo etong, 


È Le i (9.76) 
B;(z, t) = — Ey, e099, 
v1 
It gives rise to a reflected wave 
Ea(z, t) = Eod moD, 
(9.77) 


2 Le ence 
Ba(z, t) = -— Ep, 7 9, 
vi 


B; 


Er 


E; 
Br 
vi y 


Interface 


4m 
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which travels back to the left in medium 1, and a transmitted wave 


Er(z, t)= Ep ex, 


: ee cee (9.78) 
Br(, t) = yy bore 22-ODy,, 


which continues on to the right in medium 2. Note the minus sign in Ba, as 
required by Eq. 9.50 — or, if you prefer, by the fact that the Poynting vector 
aims in the direction of propagation. 

At z = 0, the combined fields on the left, E; + Ep and B; + Br, must 
join the fields on the right, E; and Br, in accordance with the boundary 
conditions (Eq. 9.75). In this case there are no components perpendicular 
to the surface, so (i) and (ii) are trivial. However, (iii) requires that 


Eo, + Ep = Eons (9.79) 
while (iv) says 
7 (G Eo, = - Eon) = A Éo), (9.80) 
or 
Eo, — Eop = BEor; (9.81) 
where 
pa Ene Ae (9.82) 


H2V2 Lon 


Equations 9.79 and 9.81 are easily solved for the outgoing amplitudes, in 
terms of the incident amplitude: 


- l-b; 4 Oy 2 
Eon = (Trp) Eo Eo, = (17g) Er (9.83) 
These results are strikingly similar to the ones for waves on a string. 


Indeed, if the permeabilities u are close to their values in vacuum (as, 
remember, they are for most media), then 6 = vı/v2, and we have 


: a ees ee 
Eng = (5, ) Eon Bor = (<<) Eon (9.84) 


which are identical to Eqs. 9.30. In that case, as before, the reflected wave is 
in phase (right side up) if v2 > vı and out of phase (upside down) if v2 < vı; 
the real amplitudes are related by 


v2 — Vi 


[Eo Eo = ( = ) Eo, (9.85) 


Eo =| 
k v2 + V1 


v2 + V1 
or, in terms of the indices of refraction, 


nı — M2 


|Eo, Eo, = ( ot Nay, (9.86) 


Eo, =| 
ny +n 


B ny + mM 


What fraction of the incident energy is reflected, and what fraction is 
transmitted? According to Eq. 9.74, the intensity (average power per unit 
area) is 
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1 
I= zE 


If (again) u1 = u2 = po, then the ratio of the reflected intensity to the 
incident intensity is 


dir Eo 2 ny —N2\2 
ie Ir = (5) =a) í ove 


whereas the ratio of the transmitted intensity to the incident intensity is 


(9.88) 


T= Ir _ ste ey E 4nın , 
Ir evı \ Eo, (nı + m)? 
Here, R is the reflection coefficient and T the transmission coefficient; they 
tell you the fraction of the incident energy that is reflected and transmitted, 
respectively. Notice that 
R+T=1, (9.89) 


as conservation of energy, of course, requires. For instance, when light 
passes from air (n) = 1) into glass (m2 = 1.5), R = 0.04 and T = 0.96. 
No surprise: most of the light is transmitted. 


Problem 9.14. Calculate the exact reflection and transmission coefficients, without assuming 
Hy = pa = uo. Confirm that R + Faek 


Problem 9.15. In writing Eqs. 9.77 and 9.78, | tacitly assumed that the reflected and transmit- 
ted waves have the same polarization as the incident wave — along the x direction. Prove that 
this must be so. [Hint: Let the polarization vectors of the transmitted and reflected waves be 


r = cos Or Å +sin Orf, tr = cos Or + sin OR, 


and prove from the boundary conditions that Or = Or = 0.] 


9.3.3 Reflection and Transmission at Oblique Incidence 


In the previous section, I treated reflection and transmission at normal 
incidence — that is, when the incoming wave hits the interface head-on. 
We now turn to the more general case of oblique incidence, in which the 
incoming wave meets the boundary at an arbitrary angle 0; (Fig. 9.14). Of 
course, normal incidence is really just a special case of oblique incidence, 
with 6; = 0, but I wanted to treat it separately, as a kind of warm-up, 
because the algebra is now going to get a little heavy. 

Suppose, then, that a monochromatic plane wave 


z ae x We 3 
E(t, 1) = Ege", Bilt, 0) = (ky x Ey) (9.90) 
1 
approaches from the left, giving rise to a reflected wave, 


7 a 2 l a ~ 
Ea(r, t) = opd, Bri, A = — (kpr X Ep), (9.91) 
val 
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kr 


SY 


Plane of incidence 


and a transmitted wave 
a 2 E loa 7 
Er(r, ) = Eye", Bra, = |, (kr x Er). (9.92) 
2 


All three waves have the same frequency w — which is determined once and 
for all at the source (the flashlight, or whatever, that produces the incident 
beam). !” The three wave numbers are related by Eq. 9.11: 


krvı = kpgryı = krv = W, or ky = kr = ia = Aia (9.93) 
val ni 


The combined fields in medium 1, É; + Er and B; + Br, must now be 
joined to the fields Ey and Br in medium 2, using the boundary conditions 
(Eq. 9.75). These all share the generic structure 


( je) + ( jee) = ( jer atz=0. (9.94) 


Pll fill in the parentheses in a moment; for now, the important thing to 
notice is that the x, y, and ¢ dependence is confined to the exponents. 
Because the boundary conditions must hold at all points on the plane, and for 
all times, these exponential factors must be equal (when z = 0). Otherwise, 
a slight change in x, say, would destroy the equality (see Prob. 9.16). Of 
course, the time factors are already equal (in fact, you could regard this as 
a confirmation that the transmitted and reflected frequencies must match 
the incident one). As for the spatial terms, evidently 


k;-r=krg-r=kry-r whenz=0, (9.95) 
or, more explicitly, 
x(kr)x t Yki) = x(Kr)x + y(kr)y = x(kr)x P y(kr)y (9.96) 


for all x and all y. 


17 Nonlinear (“active”) media can change the frequency, but we are talking only about linear 
media. 
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But Eq. 9.96 can only hold if the components are separately equal, for 
if x = 0, we get 


(kr) = (kr)y = (kr)y» (9.97) 
while y = 0 gives 
(k7)x = (kr)x = (kr)x- (9.98) 


We may as well orient our axes so that ky lies in the xz plane (that is, 
(kr) =0); according to Eq. 9.97, so too will kg and kr. Conclusion: 


First Law. The incident, reflected, and transmitted wave vectors 
form a plane (called the plane of incidence), which also includes the 
normal to the surface (here, the z-axis). 


Meanwhile, Eq. 9.98 implies that 
kr sin Or = kpr sin OR = krsin Or, (9.99) 


where @ is the angle of incidence, Op is the angle of reflection, and 07 is the 
angle of transmission (more commonly known as the angle of refraction), 
all of them measured with respect to the normal (Fig. 9.14). In view of 
Eq. 9.93, then, 


Second Law. The angle of incidence is equal to the angle of 
reflection, 


Or = Op. (9.100) 
This is the law of reflection. As for the transmitted angle, 


Third Law. sinOr _ m 
sin Or ~ n ` 


(9.101) 


This is the law of refraction — Snell’s law. 

These are the three fundamental laws of geometrical optics. It is remark- 
able how little actual electrodynamics went into them: we have yet to invoke 
any specific boundary conditions — all we used was their generic form 
(Eq. 9.94). Therefore, any other waves (water waves, for instance, or sound 
waves) can be expected to obey the same “optical” laws when they pass from 
one medium into another. 

Now that we have taken care of the exponential factors — they cancel, 
given Eq. 9.95 — the boundary conditions (Eq. 9.75) become 


(i)  €1(Eo, + Éo): = €2(Eo,)-, 
(ii) (Bo, + Bo,)- = (Bo,)zs 
(ii) (Ko, + Eo,)x,y = Eop)x,s (9.102) 


: l = £ 1 ~ 
(iv) me T Bo, )x,y = u2 (Bo, Jx y 


where Bo =(1/v)k x Éo in each case. (The last two represent pairs of 
equations, one for the x-component and one for the y-component.) 
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Br x Ez 


Suppose the polarization of the incident wave is parallel to the plane of 
incidence (the xz plane); it follows (see Prob. 9.15) that the reflected and 
transmitted waves are also polarized in this plane (Fig. 9.15). (I shall leave 
it for you to analyze the case of polarization perpendicular to the plane of 
incidence; see Prob. 9.17.) Then (i) reads 


€1(-Eo, sin Or F Ev. sin Or) = €2(—Eo, sin Or). (9.103) 


Item (ii) adds nothing (0 = 0), since the magnetic fields have no z compo- 
nents, (ili) becomes 


Eo, cos Oz + Eo, cos Or = Eo, cos Or, (9.104) 
and (iv) says 
| ee Z 1 - 
—— (Eo, - Eor) = ——E£,- (9.105) 
Hiv) H2V2 


Given the laws of reflection and refraction, Eqs. 9.103 and 9.105 both 
reduce to 


Eo, - Eo = BEor (9.106) 


where (as before) 


p= En n Am (9.107) 


’ 
uv fm 


and Eq. 9.104 says 


Eo, + Eog = a Éo, (9.108) 
where 
s (9.109) 
cos 07; 


Solving Eqs. 9.106 and 9.108 for the reflected and transmitted ampli- 
tudes, we obtain 


es a-p 
RS 


)Eo,, For = (a)én. (9.110) 
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These are known as Fresnel’s equations, for the case of polarization in 
the plane of incidence. (There are two other Fresnel equations, giving the 
reflected and transmitted amplitudes when the polarization is perpendicular 
to the plane of incidence — see Prob. 9.17.) Notice that the transmitted wave 
is always in phase with the incident one; the reflected wave is either in phase 
(“right side up”), if a > B, or 180° out of phase (“upside down”), if a < p.!8 

The amplitudes of the transmitted and reflected waves depend on the 
angle of incidence, because a is a function of 0z: 


afi = sin? Or J! = [(n/n) sin 0r}? 


.111 
cos 0; cos 0; 0 ) 


In the case of normal incidence (6; = 0), a = 1, and we recover Eq. 9.83. 
At grazing incidence (0; = 90°), a diverges, and the wave is totally reflected 
(a fact that is painfully familiar to anyone who has driven at night on a 
wet road). Interestingly, there is an intermediate angle, 0g (called Brewster’s 
angle), at which the reflected wave is completely extinguished.!? According 
to Eq. 9.110, this occurs when a = f, or 


(9.112) 


For the typical case u1 = u2, so B ~ m/n, sin? Og ~ B7/(1 + 6°), and hence 


tan Op x =. (9.113) 
1 


Figure 9.16 shows a plot of the transmitted and reflected amplitudes as 
functions of 0z, for light incident on glass (m = 1.5) from air (nı = 1). 
(On the graph, a negative number indicates that the wave is 180° out 
of phase with the incident beam — the amplitude itself is the absolute 
value.) 

The power per unit area striking the interface is S - 2. Thus the incident 
intensity is 


1 
I= 5611 EG, cos Or, (9.114) 
while the reflected and transmitted intensities are 
1 1 
TIR= 561" Eor cos@r and Ir= 5 622K), cos Or. (9.115) 


(The cosines are there because I am talking about the average power per 
unit area of interface, and the interface is at an angle to the wave front.) 


18 There is an unavoidable ambiguity in the phase of the reflected wave, since (as I mentioned 
in footnote 4) changing the sign of the polarization vector is equivalent to a 180° phase 
shift. The convention I adopted in Fig. 9.15, with Er positive “upward,” is consistent with 
some, but not all, of the standard optics texts. 

Because waves polarized perpendicular to the plane of incidence exhibit no corresponding 
quenching of the reflected component, an arbitrary beam incident at Brewster’s angle yields 
a reflected beam that is totally polarized parallel to the interface. That’s why Polaroid 
glasses, with the transmission axis vertical, help to reduce glare off a horizontal surface. 
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i L > 6, 
0° 10° 20° 30° 40° 50° 60° 70° 80° 90° 


The reflection and transmission coefficients for waves polarized parallel to 
the plane of incidence are 


TR Eo 2 a= p2 

Rela) a (9.116) 
_ Tr _ €v (Eo; y\2cos Or _ I 2 

We r AT cos 0; nae ` AE 


They are plotted as functions of the angle of incidence in Fig. 9.17 (for 
the air/glass interface). Physically, R is the fraction of the incident energy 
that is reflected — naturally, it goes to 0 at Brewster’s angle; T is the 
fraction transmitted — it goes to 1 at Og. Note that R + T = 1, as required by 
conservation of energy: the energy per unit time reaching a particular patch 
of area on the surface is equal to the energy per unit time /eaving the patch 
(see Prob. 9.43): 


Ir=Ir+ Ír. (9.118) 


Problem 9.16. Suppose Ae’ + Be!’ = Ce! for some nonzero constants A, B, C, a, 
b, c, and for all x. Prove that a = b = cand A + B = C. 
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Problem 9.17. Analyze the case of polarization perpendicular to the plane of incidence (that 
is, electric fields in the y direction, in Fig. 9.15). Impose the boundary conditions (Eq. 9.102), 
and obtain the Fresnel equations for Eo, and £o,.. Sketch (£o,/Eo,) and (Eo,./Eo,) as 
functions of Oz, for the case 6 = nz/n; = 1.5. (Note that for this 6 the reflected wave is 
always 180° out of phase.) Show that there is no Brewster's angle for any n; and n: Eo re 
is never zero (unless, of course, n; =” and u4 = u2, in which case the two media are 
optically indistinguishable). Confirm that your Fresnel equations reduce to the proper forms at 
normal incidence. Compute the reflection and transmission coefficients, and check that they add 
up to 1. 


Problem 9.18. The index of refraction of diamond is 2.42. Construct the graph analogous to 
Fig. 9.16 for the air/diamond interface. (Assume jz; = u2 = uo.) In particular, calculate (a) 
the amplitudes at normal incidence, (b) Brewster's angle, and (c) the “crossover” angle, at which 
the reflected and transmitted amplitudes are equal. 


Problem 9.19. Inthe configuration of Fig. 9.15, is it the case that 


S; = Sr + SR? 


9.4 Absorption and Dispersion 
m 


9.4.1 Electromagnetic Waves in Conductors 


In Section 9.3 I stipulated that the free charge density py and the free 
current density Jy are zero, and everything that followed was predicated 
on that assumption. Such a restriction is perfectly reasonable when you're 
talking about wave propagation through a vacuum or through insulating 
materials such as glass or (pure) water. But in the case of conductors we do 
not independently control the flow of charge, and in general J is certainly 
not zero. In fact, according to Ohm’s law, the (free) current density in a 
conductor is proportional to the electric field: 


Jy = oE. (9.119) 


With this, Maxwell’s equations for linear media assume the form 


() V-E= he dd ves 
e ðt e (9.120) 


Gi) V-B=0, (iv) VxB = poE + pes. 
Now, the continuity equation for free charge, 


yj = (9.121) 
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together with Ohm’s law and Gauss’s law (i), gives 


Ops _ a 
ar ee 


for a homogeneous linear medium, from which it follows that 
pr(t) = eo -(O). (9.122) 


Thus any initial free charge p/(0) dissipates in a characteristic time t = e/o. 
This reflects the familiar fact that if you put some free charge on a 
conductor, it will flow out to the edges. The time constant t affords a 
measure of how “good” a conductor is: for a “perfect” conductor, o = co 
and t = 0; for a “good” conductor, t is much less than the other relevant 
times in the problem (in oscillatory systems, that means t « 1/w); for a 
“poor” conductor, T is greater than the characteristic times in the problem 
(t > 1/w).7° But we're not interested in this transient behavior — we’ll wait 
for any accumulated free charge to disappear. From then on, pr = 0, and 
we have 
(i) V-E=0, (ii) VxE = -S, 
(9.123) 


Gi) V-B=0, (iv) VXB= pe?” + yok. 


These differ from the corresponding equations for nonconducting media 
(Eq. 9.68) only in the last term in (iv) — which is absent, obviously, 
when o =0. 

Applying the curl to (iii) and (iv), as before, we obtain modified wave 
equations for E and B: 


2 2 
VE= eo + Te VB = pet + Td (9.124) 


These equations still admit plane-wave solutions, 
E(z, t) = Eoded, Bi 1) = Boe’, (9.125) 
but this time the “wave number” k is complex: 
R = pew + ipow, (9.126) 


as you can easily check by plugging Eq. 9.125 into Eq. 9.124. Taking the 
square root, 


k=k+ik, (9.127) 


20 N. Ashby, Am. J. Phys. 43, 553 (1975), points out that for good conductors t is absurdly 
short (107! s for copper, whereas the time between collisions is Te = 10714 s). The problem 
is that Ohm’s law itself breaks down on timescales shorter than te; actually, the time it takes 
free charge to dissipate in a good conductor is of order Te, not t. Moreover, H. C. Ohanian, 
Am. J. Phys. 51, 1020 (1983), shows that it takes even longer for the fields and currents to 
equilibrate. But none of this is relevant to our present purpose; the net free charge density 
in a conductor does quickly dissipate, and exactly how long the process takes is beside the 
point. 
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where 


k=o =| 1+(_) +1] > KEW £| 1+(2)-1| š 
(9.128) 
The imaginary part of k results in an attenuation of the wave (decreasing 
amplitude with increasing z): 
E(z, 8) = Eo F eD, Bz t) = Boe “el, (9.129) 


The distance it takes to reduce the amplitude by a factor of 1/e (about a 
third) is called the skin depth: 


d=-; (9.130) 


ale 


it is a measure of how far the wave penetrates into the conductor. Mean- 

while, the real part of k determines the wavelength, the propagation speed, 
and the index of refraction, in the usual way: 

2T w ck 

A=: VS oe a 

The attenuated plane waves (Eq. 9.129) satisfy the modified wave equa- 

tion (9.124) for any Ey and Bo. But Maxwell’s equations (9.123) impose fur- 

ther constraints, which serve to determine the relative amplitudes, phases, 

and polarizations of E and B. As before, (i) and (ii) rule out any z compo- 

nents: the fields are still transverse. We may as well orient our axes so that 

E is polarized along the x direction: 


(9.131) 


E(z, t) = Egoe eoD, (9.132) 


Then (iii) gives 


x k x PO EUP I 
Biz, t) = _ bo el ods, (9.133) 


(Equation (iv) says the same thing.) Once again, the electric and magnetic 
fields are mutually perpendicular. 

Like any complex number, k can be expressed in terms of its modulus 
and phase: 


k = Ke'®, (9.134) 
where 
K = lkl = Ve + = oqlep (2y (9.135) 
and 
Q = tan! (x/k). (9.136) 


According to Eqs. 9.132 and 9.133, the complex amplitudes Ey = Ege! 
and Bo = Boe’? are related by 


Ke? 
Boe”? = —— Foe. (9.137) 
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Evidently the electric and magnetic fields are no longer in phase; in fact, 
ôg — ôg = Ọ; (9.138) 


the magnetic field /ags behind the electric field. Meanwhile, the (real) 
amplitudes of E and B are related by 


Bo K 
D u eu 1+(—) < (9.139) 


The (real) electric and magnetic fields are, finally, 


E(z, t) = Ege” cos(kz — wt + d¢)X, | (9.140) 


B(z, t) = Boe“ cos(kz — wt + ôg + b)y. 


These fields are shown in Fig. 9.18. 


Problem 9.20. 


(a) Suppose you embedded some free charge in a piece of glass. About how long would it take 
for the charge to flow to the surface? 

(b) Silver is an excellent conductor, but it's expensive. Suppose you were designing a microwave 
experiment to operate at a frequency of 10!° Hz. How thick would you make the silver 
coatings? 

(c) Find the wavelength and propagation speed in copper for radio waves at 1 MHz. Compare 
the corresponding values in air (or vacuum). 


Problem 9.21. 


(a) Show that the skin depth in a poor conductor (o < we)is (2/0) Ve/ (independent of 
frequency). Find the skin depth (in meters) for (pure) water. (Use the static values of €, u, 
and o; your answers will be valid only at relatively low frequencies.) 

(b) Show that the skin depth in a good conductor (o >> we) is A/27c (where A is the 
wavelength in the conductor). Find the skin depth (in nanometers) for a typical metal 
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(¢ = 107(Q-m)~!) in the visible range (w ~ 10!>/s), assuming € ~ €9 and 
u © uo. Why are metals opaque? 

(c) Show that in a good conductor the magnetic field lags the electric field by 45°, and find 
the ratio of their amplitudes. For a numerical example, use the “typical metal” in part (b). 


Problem 9.22. 


(a) Calculate the (time-averaged) energy density of an electromagnetic plane wave in a 
conducting medium (Eq. 9.140). Show that the magnetic contribution always dominates. 
[Answer: (k?/2uw?) Ege] 

(b) Show that the intensity is (A/2 uw) Efe”. 


9.4.2 Reflection at a Conducting Surface 


The boundary conditions we used to analyze reflection and refraction at an 
interface between two dielectrics do not hold in the presence of free charges 
and currents. Instead, we have the more general relations (Eq. 7.73) 

(i) Et -e.Ey =o,, (üi) E! — E! = 0, 

ii) BL - BL =0 ptt ptegxal C 

(ii) By — By = 0, a eee f X, 
where op (not to be confused with conductivity) is the free surface charge, Ky 
is the free surface current, and n (not to be confused with the polarization 
of the wave) is a unit vector perpendicular to the surface, pointing from 
medium 2 into medium 1. For ohmic conductors (Jy = oE) there can be no 
free surface current, since this would require an infinite electric field at the 
boundary.”! 

Suppose now that the xy plane forms the boundary between a noncon- 
ducting linear medium 1 and a conductor 2. A monochromatic plane wave, 
traveling in the z direction and polarized in the x direction, approaches from 
the left, as in Fig. 9.13: 


: er ae a ee 
E;(z, t) = Eog, = By(z, t) = Eoee}, (9.142) 
This incident wave gives rise to a reflected wave, 
2 = PAE AN A = ie ; - 
Er(z,t) = Eo, "9%, Br(z, ft) = SpE ten}, (9.143) 


propagating back to the left in medium 1, and a transmitted wave 


` n a) khaa ee 
Èr (z, À) = Ëo oR, Br, D) = — Eoi, (9.144) 
w 
which is attenuated as it penetrates into the conductor. 


21 The vector J is a delta function at a surface current. TIl consider the limiting case of a 
perfect conductor (o = œ) ina moment. 
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At z=0, the combined wave in medium 1 must join the wave in medium 2, 
pursuant to the boundary conditions (Eq. 9.141). Since E+ =0 on both 
sides, boundary condition (i) yields gp = 0. Since B+ = 0, (ii) is automatically 
satisfied. Meanwhile, (iii) gives 


Eo, + Eog = Éor, (9.145) 
and (iv) (with Ky = 0) says 
bo - Eo,) - a = 0, (9.146) 
or 
Eo, — Eo, = BEo;; (9.147) 
where 
pz mo” (9.148) 
It follows that 
Eq, = (pe Eo, = (pir (9.149) 


These results are formally identical to the ones that apply at the boundary 
between nonconductors (Eq. 9.83), but the resemblance is deceptive since f 
is now a complex number. - 
For a perfect conductor (0 = œ), k2 = œ (Eq. 9.128), so f is infinite, and” 
Eo, = —Eo,;, Eo, = 0. (9.150) 
In this case the wave is totally reflected, with a 180° phase shift. (That’s 
why excellent conductors make good mirrors. In practice, you paint a thin 
coating of silver onto the back of a pane of glass — the glass has nothing 
to do with the reflection; it’s just there to support the silver and to keep it 


from tarnishing. Since the skin depth in silver at optical frequencies is less 
than 100 A, you don’t need a very thick layer.) 


Problem 9.23. Calculate the reflection coefficient for light at an air-to-silver interface 
(u1 = u2 = Ho, €1 = €0, 0 =6 X 10’(Q - m)~!), at optical frequencies (w =4 x 
10!9/s). 


9.4.3 The Frequency Dependence of Permittivity 


In the preceding sections, we have seen that the propagation of electromag- 
netic waves through matter is governed by three properties of the material: 
the permittivity €, the permeability u, and the conductivity ø. Actually, 


22 For very large o, k ~ K ~ wVo/2 (Eq. 9.128), Eor goes like 1/-/a, and the free current 
density Jy = dE increases as the square root of o. But this current is confined to a “skin” 
of diminishing depth d = 1/x ~ 1/-Yo, and goes to a surface current in the limit o > ov. 
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1.450 


each of these parameters depends to some extent on the frequency of 
the waves you are considering. Indeed, it is well known from optics that 
nx 4E, is a function of wavelength (Fig. 9.19 shows the graph for a typical 
glass). A prism or a raindrop bends blue light more sharply than red, and 
spreads white light out into a rainbow of colors. This phenomenon is called 
dispersion. By extension, whenever the speed of a wave depends on its 
frequency, the supporting medium is said to be dispersive. 

Because waves of different frequency travel at different speeds in a 
dispersive medium, a wave form that incorporates a range of frequencies 
will change shape as it propagates. A sharply peaked wave typically flattens 
out, and whereas each sinusoidal component travels at the ordinary wave 
(or phase) velocity, 


VSS (9.151) 


the packet as a whole (the “envelope”) moves at the so-called group 
velocity” 
_ dw 
Vg = F 
[You can demonstrate this by dropping a rock into the nearest pond and 
watching the waves that form: While the disturbance as a whole spreads 
out in a circle, moving at speed vz, the ripples that go to make it up will 
be seen to travel twice as fast (v=2v, in this case). They appear at the 
back end of the packet, growing as they move forward to the center, then 
shrinking again and fading away at the front (Fig. 9.20).] We shall not 
concern ourselves with these matters — I'll stick to monochromatic waves, 
for which the problem does not arise. But I should just mention that the 
energy carried by a wave packet in a dispersive medium does not travel at the 
phase velocity. Don’t be too alarmed, therefore, if in some circumstances v 
comes out greater than c.” 


(9.152) 


23 See A. P. French, Vibrations and Waves, W. W. Norton, New York (1971), p. 230, or F. S. 
Crawford, Jr., Waves, McGraw-Hill, New York (1968), Sect. 6.2. 

24 Even the group velocity can exceed c in special cases — see P. C. Peters, Am. J. Phys. 56, 129 
(1988), or work Prob. 9.27. For delightful commentary, see C. F. Bohren, Am. J. Phys. 77, 
101 (2009). And if two different “speeds of light” are not enough to satisfy you, check out 
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My purpose in this section is to account for the frequency dependence 
of e in dielectrics, using a simplified model for the behavior of the electrons. 
Like all classical models of atomic-scale phenomena, it is at best an approx- 
imation to the truth; nevertheless, it does yield qualitatively satisfactory 
results, and it provides a plausible mechanism for dispersion in transparent 
media. 

The electrons in a nonconductor are bound to specific molecules. The 
actual binding forces can be quite complicated, but we shall picture each 
electron as attached to the end of a spring, with force constant kgpring 
(Fig. 9.21): 

Foinding = —Kepring* = -mw%x, (9.153) 


where x is the displacement from equilibrium, 7n is the electron’s mass, and 
wo is the natural oscillation frequency, /Kspring/m. If this strikes you as 
an implausible model, look back at Ex. 4.1, where we were led to a force 
of precisely this form. As a matter of fact, practically any binding force 
can be approximated this way for sufficiently small displacements from 
equilibrium, as you can see by expanding the potential energy in a Taylor 
series about the equilibrium point: 


U(x) = U(0) + xU'(0) + seu") tte 


The first term is a constant, with no dynamical significance (you can 
always adjust the zero of potential energy so that U(0) = 0). The second 
term automatically vanishes, since dU/dx = —F, and by the nature of an 
equilibrium, the force at that point is zero. The third term is precisely the 
potential energy of a spring with force constant kspring = d?U/dx*|o (the 
second derivative is positive, for a point of stable equilibrium). As long 
as the displacements are small, the higher-order terms in the series can be 
neglected. Geometrically, all I am saying is that practically any function can 
be fit near a minimum by a suitable parabola. 

Meanwhile, there will presumably be some damping force on the 


electron: 
dx 


dt 


Fdamping = -My (9.154) 


S. C. Bloch, Am. J. Phys. 45, 538 (1977), in which no fewer than eight distinct velocities are 
identified! Indeed, it’s not clear what you mean by the “velocity” of something that changes 
shape as it moves, and has no precise beginning or end. Do you mean the speed at which 
the peak intensity propagates? Or the speed at which energy is transported? Or information 
transmitted? In special relativity no causal signal can travel faster than c, but some of the 
other “velocities” have no such restriction. 
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Electron 


¥ Kspring 


NY 


Again I have chosen the simplest possible form; the damping must be 
opposite in direction to the velocity, and making it proportional to the 
velocity is the easiest way to accomplish this. The cause of the damping does 
not concern us here — among other things, an oscillating charge radiates, 
and the radiation siphons off energy. We will calculate this “radiation 
damping” in Chapter 11. 

In the presence of an electromagnetic wave of frequency w, polarized in 
the x direction (Fig. 9.21), the electron is subject to a driving force 


Fariving = GE = qEp cos(wt), (9.155) 


where q is the charge of the electron and Epo is the amplitude of the wave 
at the point z where the electron is situated. (Since we’re only interested in 
one point, I have reset the clock so that the maximum £ occurs there at 
t = 0. For simplicity, I assume the magnetic force is negligible.) Putting all 
this into Newton’s second law gives 


ax 
mM = Fot = Finding + Fdamping + Foriving, 
or 
d dx 
m + my + mw,x = qE cos(wt). (9.156) 


Our model, then, describes the electron as a damped harmonic oscillator, 
driven at frequency w. (The much more massive nucleus remains at rest.) 

Equation 9.156 is easier to handle if we regard it as the real part of a 
complex equation: 


P3 dx 2= q —iwt 
de EK ers + Wx = mee : (9.157) 
In the steady state, the system oscillates at the driving frequency: 
X(t) = Xe. (9.158) 
Inserting this into Eq. 9.157, we obtain 
a a (9.159) 
w- w? -iyw 


The resulting dipole moment is the real part of 


- z 2m ee 
PO = 8) = = Be", (9.160) 
wk- w? — iyw 
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The imaginary term in the denominator means that p is out of phase with E 
— lagging behind by an angle tan”! [yw/(w 5 — w*)] that is very small when 
w < wo and rises to 7 when w > wo. 

In general, differently situated electrons within a given molecule experi- 
ence different natural frequencies and damping coefficients. Let’s say there 
are f; electrons with frequency w; and damping y; in each molecule. If there 
are N molecules per unit volume, the polarization P is given by” the real 
part of 


2 
P-S -= 4 — Ë. (9.161) 
F A HR 
Now, I defined the electric susceptibility as the proportionality constant 
between P and E (specifically, P=e€ )x-E). In the present case, P is not 
proportional to E (this is not, strictly speaking, a linear medium) because of 
the difference in phase. However, the complex polarization P is proportional 
to the complex field E, and this suggests that we introduce a complex 
susceptibility, %e: 
P = eoXeE. (9.162) 


All of the manipulations we went through before carry over, on the under- 
standing that the physical polarization is the real part of P, just as the 
physical field is the real part of E. In particular, the proportionality between 
D and Eis the complex permittivity ë = €o(1+ ğe), and the complex dielectric 
constant (in this tee is 


č, = : 9.163 
S T iea le w? = a iyjw ( ) 


Ordinarily, the imaginary term is negligible; however, when w is very close 
to one of the resonant frequencies (w;) it plays an important role, as we 
shall see. 

In a dispersive medium, the wave equation for a given frequency reads 


VE = mas (9.164) 
it admits plane wave solutions, as before, 
E(z, t) = Eye, (9.165) 
with the complex wave number 
k= Jéuow. (9.166) 


Writing k in terms of its real and imaginary parts, 


k=k+ix, (9.167) 


25 This applies directly to the case of a dilute gas; for denser materials the theory is modified 
slightly, in accordance with the Clausius—Mossotti equation (Prob. 4.42). By the way, don’t 
confuse the “polarization” of a medium, P, with the “polarization” of a wave — same word, 
but completely unrelated meanings. 
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Eq. 9.165 becomes 
EG) = Éo Ve, (9.168) 


The wave is attenuated (this is hardly surprising, since the damping absorbs 
energy). Because the intensity is proportional to E? (and hence to e~7**), 
the quantity 


a=2k« (9.169) 


is called the absorption coefficient. Meanwhile, the wave velocity is w/k, and 
the index of refraction is 


n=—, (9.170) 
w 


I have deliberately used notation reminiscent of Section 9.4.1. However, 
in the present case k and x have nothing to do with conductivity; rather, 
they are determined by the parameters of our damped harmonic oscillator. 
For gases, the second term in Eq. 9.163 is small, and we can approximate 
the square root (Eq. 9.166) by the first term in the binomial expansion, 
Vl +e x1 + 4e. Then 


s w 
keL els oot fear 7- A (9.171) 


c iyjw 
SO 
na 14 NE 2 a (9.172) 
w 2mE€o (w? — w?) + y 2 w2 
and 
inms fri (9.173) 


2 2)2 230 
meoc & a : 
c & (w; w?) +y7w 


In Fig. 9.22 I have plotted the index of refraction and the absorption 
coefficient in the vicinity of one of the resonances. Most of the time the 
index of refraction rises gradually with increasing frequency, consistent 
with our experience from optics (Fig. 9.19). However, in the immediate 
neighborhood of a resonance, the index of refraction drops sharply. Because 
this behavior is atypical, it is called anomalous dispersion. Notice that the 
region of anomalous dispersion (wı < w < w in the figure) coincides with 
the region of maximum absorption; in fact, the material may be practically 
opaque in this frequency range. The reason is that we are now driving the 
electrons at their “favorite” frequency; the amplitude of their oscillation is 
relatively large, and a correspondingly large amount of energy is dissipated 
by the damping mechanism. 

In Fig. 9.22, n runs below 1 above the resonance, suggesting that the 
wave speed exceeds c. As I mentioned earlier, this is no immediate cause 
for alarm, since energy does not travel at the wave velocity. Moreover, the 
graph does not include the contributions of other terms in the sum, which 
add a relatively constant “background” that, in some cases, keeps n > 1 on 
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both sides of the resonance. Incidentally, the group velocity can also exceed 
cin the neighborhood of a resonance, in this model (see Prob. 9.27). 

If you agree to stay away from the resonances, the damping can be 
ignored, and the formula for the index of refraction simplifies: 


roe lr Re (9.174) 


For most substances the natural frequencies w; are scattered all over the 
spectrum in a rather chaotic fashion. But for transparent materials, the 
nearest significant resonances typically lie in the ultraviolet, so that w < a. 
In that case, 


1 1 2 -i 1 2 


and Eq. 9.174 takes the form 
Ng Íi 2 Ng Si 
=1 = =]. 1 
Á i (54 2 z) a (54 2 =) 179) 


Or, in terms of the wavelength in vacuum (A = 27c/w), 


n=1+A(1+55). (9.176) 


This is known as Cauchy’s formula; the constant A is called the coefficient 
of refraction, and B is called the coefficient of dispersion. Cauchy’s equation 
applies reasonably well to most gases, in the optical region. 

What I have described in this section is certainly not the complete story 
of dispersion in nonconducting media. Nevertheless, it does indicate how 
the damped harmonic motion of electrons can account for the frequency 
dependence of the index of refraction, and it explains why n is ordinarily 
a slowly increasing function of w, with occasional “anomalous” regions 
where it precipitously drops. 
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Problem 9.24. The relation between wave velocity and group velocity depends on the kind of 
waves you are talking about. 


(a) Shallow water is nondispersive; waves travel at a speed that is proportional to the square 
root of the depth. In deep water, however, the waves can't “feel” all the way down to 
the bottom — they behave as though the depth were proportional to A. (Actually, the 
distinction between “shallow” and “deep” itself depends on the wavelength: If the depth is 
less than A, the water is “shallow”; if it is substantially greater than A, the water is “deep.’) 
Show that the wave velocity of deep-water waves is twice the group velocity. 

In quantum mechanics, a free particle of mass 7n traveling in the x direction is described 
by the wave function 


= 
a 
z= 


Yx t =A el(px-Enih 


where p is the momentum and Æ = p~/2m is the kinetic energy. Calculate the group 
velocity and the wave velocity. Which one corresponds to the classical speed of the particle? 
Note that the wave velocity is half the group velocity. 


Problem 9.25. If you take the model in Ex. 4.1 at face value, what natural frequency do you 
get? Putin the actual numbers. Where, in the electromagnetic spectrum, does this lie, assuming 
the radius of the atom is 0.5 A? Find the coefficients of refraction and dispersion, and compare 
them with the measured values for hydrogen at 0° C and atmospheric pressure: A = 1.36 x 
105; B= 17 <x 10 am 


Problem 9.26. Find the width of the anomalous dispersion region for the case of a single 
resonance at frequency a. Assume y < wo. Show that the index of refraction assumes its 
maximum and minimum values at points where the absorption coefficient is at half-maximum. 


Problem 9.27. Starting with Eq. 9.172, calculate the group velocity, assuming there is only one 
resonance, at wo. Use a computer to graph y = vg/c as a function of x = (w/w), from 
x = Oto 2, (a) for y = 0,and (b) for y = (0.1) wo. Let (Ng?) /(2meow3) = 0.003. 
Note that the group velocity can exceed c. 


9.5 Guided Waves 


9.5.1 Wave Guides 


So far, we have dealt with plane waves of infinite extent; now we consider 
electromagnetic waves confined to the interior of a hollow pipe, or wave 
guide (Fig. 9.23). We’ll assume the wave guide is a perfect conductor, so 
that E = 0 and B = 0 inside the material itself, and hence the boundary 
conditions at the inner wall are”° 


26 See Eq. 9.141 and Prob. 7.47. In a perfect conductor, E = 0, and hence (by Faraday’s law) 
OB/dt = 0. Assuming the magnetic field started out zero, then, it will remain so. 
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(9.177) 


(i) E" =0, 
(ii) B+ =0. 


Free charges and currents” will be induced on the surface in such a way as 
to enforce these constraints. 
We are interested in monochromatic waves that propagate down the tube, 


so E and B have the generic form 
(i) EC, y, z, 2) = Eo, ye, oni 
(ii) B(x, y, z, 2) = Bo(x, ye. ` 


(For the cases of interest, k is real, so I shall dispense with its tilde.) The 
electric and magnetic fields must, of course, satisfy Maxwell’s equations, in 
the interior of the wave guide: 


(i) V-E=0, (ii) VxE=-<, 
ae (9.179) 
(i) V-B=0, Gv) VxB= 57. 


The problem, then, is to find functions Ep and Bo such that the fields 
(Eq. 9.178) obey the differential equations (Eq. 9.179), subject to boundary 
conditions (Eq. 9.177). 

As we shall soon see, confined waves are not (in general) transverse; in 
order to fit the boundary conditions we shall have to include longitudinal 
components (E; and B-):”° 


Eo = E,X+E,¥+E,2, Bo = BX + Byy + Bt, (9.180) 


where each of the components is a function of x and y. Putting this into 
Maxwell’s equations (iii) and (iv), we obtain (Prob. 9.28a) 


27 Tn Section 9.4.2 I argued that there can be no surface currents in an ohmic conductor (with 
finite conductivity). But there are volume currents, extending in (roughly) to the skin depth. 
As the conductivity increases, they are squeezed into a thinner and thinner layer, and in the 
limit of a perfect conductor they become true surface currents. See footnote 22. 

28 To avoid cumbersome notation, I leave the subscript 0 and the tilde off the individual 
components. 
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è OE, OE, $ OB, OB, iw 
= — = j B- ———— ae: = = —= zy. 
© ôx dy es wy) Ox ody eo” 
.. OF; 5 j 
iy OF _ ikEy =iwBy, (v) ôB- inp =e 0189) 
oy oy Í c? 
ðB- iw 


RER OE, _, ey 
(iii) ikEx - ae iw By, (vi) ikB, - ae TZE 


Equations (ii), (ili), (v), and (vi) can be solved for Ex, Ey, By, and B,: 
i OE, a) 


DE =- 
© 4s (ole = Ax © ay 
E, = Do 
i) £, (wie)? — a oy w Ox i; 
; (9.182) 
eee i a a = 
B, = “a 
Gii) B, (wie)? — a ôx œe dy ) 
i OB. | w OE. 
Er (wle)? — zl oy "2 Ox ) 


It suffices, then, to determine the longitudinal components E, and B-; if we 
knew those, we could quickly calculate all the others, just by differentiating. 
Inserting Eq. 9.182 into the remaining Maxwell equations (Prob. 9.28b) 
yields uncoupled equations for E- and B,: 


02 2 
+ (olo? -RIE. = 0, 


' 0 
(i) Be F 


(9.183) 
(ii) Ff — + (wie) - K| B: = 0. 

If E = 0, we call these TE a electric”) waves; if B, = 0, they 
are called TM (“transverse magnetic”) waves; if both E. = 0 and B, = 0, 
we call them TEM waves.” It turns out that TEM waves cannot occur in a 
hollow wave guide. 

Proof. If E, = 0, Gauss’s law (Eq. 9.1791) says 

OE, 0E, 


ax oy =0, 
and if B, = 0, Faraday’s law (Eq. 9.179iii) says 

ðE, Ex _ 

Ox oy 


Indeed, the vector Eg in Eq. 9.180 has zero divergence and zero curl. It 
can therefore be written as the gradient of a scalar potential that satis- 
fies Laplace’s equation. But the boundary condition on E (Eq. 9.177) 


29 In the case of TEM waves (including the unconfined plane waves of Section 9.2), k = w/c, 
Eqs. 9.182 are indeterminate, and you have to go back to Eqs. 9.181. 
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requires that the surface be an equipotential, and since Laplace’s 
equation admits no local maxima or minima (Section 3.1.4), this 
means that the potential is constant throughout, and hence the 
electric field is zero — no wave at all. Oo 


Notice that this argument applies only to a completely empty pipe — if 
you run a separate conductor down the middle, the potential at its surface 
need not be the same as on the outer wall, and hence a nontrivial potential 
is possible. We’ll see an example of this in Section 9.5.3. 


Problem 9.28. 


(a) Derive Eqs. 9.181, and from these obtain Eqs. 9.182. 
(b) Put Eqs. 9.182 into Maxwell's equations (i) and (ii) to obtain Eqs. 9.183. Check that you get 
the same results using (i) and (iv) of Eqs. 9.181. 


9.5.2 TE Waves in a Rectangular Wave Guide 


Suppose we have a wave guide of rectangular shape (Fig. 9.24), with height 
a and width b, and we are interested in the propagation of TE waves. The 
problem is to solve Eq. 9.18311, subject to the boundary condition 9.17711. 
We'll do it by separation of variables. Let 


BAX, y) = X(x) Y(y), 


so that 
X LY > 


Divide by XY, and note that the x- and y-dependent terms must be 
constants: 


1 ex “ Ley 
Yq” -k, (ii) => = -k, (9.184) 


(i rae 


ce 
= 


a 


PUURRRULRERURERERRERURRERERED 


Les 


y 
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with 
-k - ky + (wle? -k =0. (9.185) 
The general solution to Eq. 9.1841 is 
X(x) = A sin(kxx) + B cos(kxx). 


But the boundary conditions require that By — and hence also (Eq. 9.182111) 
dX/dx — vanishes at x = 0 and x = a. So A = 0, and 


k, =mtla (m=0,1,2,...). (9.186) 
The same goes for Y, with 
k, =nnlb (n=0,1,2,...), (9.187) 
and we conclude that 
B- = By cos(mmx/a) cos(utty/b). (9.188) 


This solution is called the TE,,, mode. (The first index is conventionally 
associated with the /arger dimension, so we assume a > b. By the way, at 
least one of the indices must be nonzero — see Prob. 9.29.) The wave number 
(k) is obtained by putting Eqs. 9.186 and 9.187 into Eq. 9.185: 


k = (cle)? - 12[(mla)? + (lb)? (9.189) 


w < ctf (mia)? + (nib)? = Omn, (9.190) 


the wave number is imaginary, and instead of a traveling wave we have 
exponentially attenuated fields (Eq. 9.178). For this reason, Wy is called 
the cutoff frequency for the mode in question. The lowest cutoff frequency 
for a given wave guide occurs for the mode TE}: 


If 


w10 = c/a. (9.191) 


Frequencies less than this will not propagate at all. 
The wave number can be written more simply in terms of the cutoff 


frequency: 
1 
k = 4w? — wy. 9.192 
c w Winn ( ) 


Spicy Os c (9.193) 


k y l1- (Wml)? i 


which is greater than c. However (see Prob. 9.31), the energy carried by the 
wave travels at the group velocity (Eq. 9.152): 


= 1 = | 2 
Vg = dda =c/1 (W/W) <C (9.194) 


There’s another way to visualize the propagation of an electromagnetic 
wave in a rectangular pipe, and it serves to illuminate these results. Consider 


The wave velocity is 
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Wave fronts 


an ordinary plane wave, traveling at an angle 0 to the z-axis, and reflecting 
perfectly off each conducting surface (Fig. 9.25). In the x and y directions, 
the (multiply reflected) waves interfere to form standing wave patterns, 
of wavelength A, = 2a/m and A, = 2b/n (hence wave number ky = 
2n/A, = nmla and k, = nnlb), respectively. Meanwhile, in the z direction 
there remains a traveling wave, with wave number k, = k. The propagation 
vector for the “original” plane wave is therefore 


ki = gs ys ke, 
a b 


and the frequency is 


w = c\k’| = elke + T? |(mla} + (nlb)?| = (ck? + (Wyn). 


Only certain angles will lead to one of the allowed standing wave patterns: 


k 
cos 0 = — = 41 — (@mm/w). 
ki yl = ) 


The plane wave travels at speed c, but because it is going at an angle @ to 
the z-axis, its net velocity down the wave guide is 


Vg = ccos O = cy] — (Wmlw). 


The wave velocity, on the other hand, is the speed of the wave fronts (A, say, 
in Fig. 9.25) down the pipe. Like the intersection of a line of breakers with 
the beach, they can move much faster than the waves themselves—in fact 


Cc c 


p cos @ 1 y1 ze (Oml)? 


Problem 9.29. Show that the mode T Eoo cannot occur in a rectangular wave guide. [Hint: 
In this case w/c = k, so Eqs. 9.182 are indeterminate, and you must go back to Eq. 9.181. Show 
that B- is a constant, and hence — applying Faraday’s law in integral form to a cross section — 
that B- = 0, so this would be a TEM mode.] 
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Problem 9.30. Consider a rectangular wave guide with dimensions 2.28 cm x 1.01 cm. 
What TE modes will propagate in this wave guide, if the driving frequency is 1.70 x 10!° Hz? 
Suppose you wanted to excite only one TE mode; what range of frequencies could you use? What 
are the corresponding wavelengths (in open space)? 


Problem 9.31. Confirm that the energy inthe TE,,,, mode travels at the group velocity. [Hint: 
Find the time averaged Poynting vector (Sy and the energy density (u) (use Prob. 9.12 if you 
wish). Integrate over the cross section of the wave guide to get the energy per unit time and per 
unit length carried by the wave, and take their ratio.] 


Problem 9.32. Work out the theory of TM modes for a rectangular wave guide. In particular, 
find the longitudinal electric field, the cutoff frequencies, and the wave and group velocities. 
Find the ratio of the lowest TM cutoff frequency to the lowest TE cutoff frequency, for a given 
wave guide. [Caution: What is the lowest TM mode?] 


9.5.3 The Coaxial Transmission Line 


In Section 9.5.1 I showed that a hollow wave guide cannot support TEM 
waves. But a coaxial transmission line, consisting of a long straight wire 
of radius a, surrounded by a cylindrical conducting sheath of radius b 
(Fig. 9.26), does admit modes with E. = 0 and B, = 0. In this case Maxwell’s 
equations (Eq. 9.181) yield 


k = wle (9.195) 
(so the waves travel at speed c, and are nondispersive), 
cB, =E, and cB, =-E, (9.196) 
(so E and B are mutually perpendicular), and (together with V- E = 0, 
V-B=0) 
OE, ôE _ 0 OE, E 


fe oy | Ox Oy” 


OB, ôB, _ ðB, OB, _ 0 
Ox dy ”’ Ox Ody ` 


(9.197) 


These are precisely the equations of electrostatics and magnetostatics, for 
empty space, in two dimensions; the solution with cylindrical symmetry can 


> 
Zz 


ft 
POO O] 
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be borrowed directly from the case of an infinite line charge and an infinite 
straight current, respectively: 
A. A> 
Eo(s, @) = Pi Bo(s, 0) = a? (9.198) 


for some constant A. Substituting these into Eq. 9.178, and taking the 
real part, 
A cos(kz — wt). 


E(s, Q, Z, t) = : 
(9.199) 
B(s, $, z, t) = Aok ong, 


Problem 9.33. 


(a) Show directly that Eqs. 9.199 satisfy Maxwell's equations (Eq. 9.179) and the boundary 
conditions (Eq. 9.177). 
(b) Find the charge density, A (z, £), and the current, 7(z, £), on the inner conductor. 


More Problems on Chapter 9 


Problem 9.34. The “inversion theorem’ for Fourier transforms states that 


bz) = { * be dk @ Ok) = — i ” BO dz. (9.200) 


co 


Use this to determine A (k), in Eq. 9.20, in terms of f (z, 0) and f (z, 0). 
(Answer: (1/27) [°° [f (z, 0) + (ilw)f (z, 0)Je“™ dz] 


Problem 9.35. 


(a) Equation 9.49 represents a plane wave polarized in the x direction and traveling (through 
vacuum) in the z direction. Write down the analogous expression for a wave polarized in 
the y direction and traveling in the z direction. (Note that E x B must point in the direction 
of propagation.) 

(b) From Eq. 9.49 and your solution to (a), construct the two circularly polarized waves propa- 
gating in the z direction (see Prob. 9.8). 

(c) Standing waves are produced when waves of the same amplitude and frequency but 
propagating in opposite directions are superimposed. Construct the plane-polarized stand- 
ing wave obtained from the fields in Eq. 9.49 (use trig identities to simplify the result). Find 
the energy density (u), momentum density (g), and Poynting vector S for this wave. Also 
determine the time averages: (7), (g), and (S). 

(d) Construct the circularly polarized standing waves, obtained from your solution to (b), and 
find u, g, and S. 
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Problem 9.36. [The naive explanation for the pressure of light offered in Section 9.2.3 has 
its flaws, as you discovered if you worked Prob. 9.11. Here's another account, due originally 
to Planck.”] A plane wave traveling through vacuum in the z direction encounters a perfect 
conductor occupying the region z > 0, and reflects back: 


E(z, t) = Eg[cos(kz - wt) — cos(kz + w)| (z< 0). 


(a) Find the accompanying magnetic field (in the region z < 0). 

(b) Assuming B = Oinside the conductor, find the current K on the surface z = 0, by invoking 
the appropriate boundary condition. 

(c) Find the magnetic force per unit area on the surface, and compare its time average with the 
expected radiation pressure (Eq. 9.65). 


Problem 9.37. Suppose 


sin 0 
r 


E(r, 0, ,t) = A [cos(kr — wt) — (1/kr) sin(kr — wt), with = 26 


(This is, incidentally, the simplest possible spherical wave. For notational convenience, let 
(kr — wt) = uin your calculations.) 


(a) Show that E obeys all four of Maxwell's equations, in vacuum (r + 0), and find the 
associated magnetic field. 

(b) Calculate the Poynting vector. Average S over a full cycle to get the intensity vector I. (Does 
it point in the expected direction? Does it fall off like 772, as it should?) 

(c) Integrate I - da over a spherical surface to determine the total power radiated. [Answer: 
47 A7/3 9c] 


Problem 9.38. Light of (angular) frequency œ passes from medium 1, through a slab (thick- 
ness d) of medium 2, and into medium 3 (for instance, from water through glass into air, as in 
Fig. 9.27). Show that the transmission coefficient for normal incidence is given by 


T`! = 


D Di Df 

el a mrs ao sin? (224), (9.201) 
Nyn3 ny c 

[Hint: To the left, there is an incident wave and a reflected wave; to the right, there is a 

transmitted wave; inside the slab, there is a wave going to the right and a wave going to the 

left. Express each of these in terms of its complex amplitude, and relate the amplitudes by 

imposing suitable boundary conditions at the two interfaces. All three media are linear and 


homogeneous; assume u1 = u2 = u3 = o.l 


Problem 9.39. A microwave antenna radiating at 10 GHz is to be protected from the envi- 
ronment by a plastic shield of dielectric constant 2.5. What is the minimum thickness of 
this shielding that will allow perfect transmission (assuming normal incidence)? [Hint: Use 
Eq. 9.201.] 


30 T, Rothman and S. Boughn, Am. J. Phys. 77, 122 (2009), Sect. IV. 
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> 
Water Glass Air Z 


Fig. 9.27 


Problem 9.40. Light from an aquarium (Fig. 9.27) goes from water (n = 4) through a 
plane of glass (7 = 3) into air (72 = 1). Assuming it’s a monochromatic plane wave and that it 
strikes the glass at normal incidence, find the minimum and maximum transmission coefficients 
(Eq. 9.201). You can see the fish clearly; how well can it see you? 


Problem 9.41. According to Snell’s law, when light passes from an optically dense medium 
into a less dense one (7; > n2), the propagation vector k bends away from the normal 
(Fig. 9.28). In particular, if the light is incident at the critical angle 


0e = sin7!(n/n1), (9.202) 


then Oy = 90°, and the transmitted ray just grazes the surface. If Oz exceeds Oe, there is no 
refracted ray at all, only a reflected one (this is the phenomenon of total internal reflection, 
on which light pipes and fiber optics are based). But the fields are not zero in medium 2; what 
we get is a so-called evanescent wave, which is rapidly attenuated and transports no energy 
into medium 2." 

A quick way to construct the evanescent wave is simply to quote the results of Section 9.3.3, 
with kr = wno/cand 


kr = kr(sin Or x + cos 0r 2); 
the only change is that 


F nj . 
sin Or = — sin 07 
n2 


is now greater than 1, and 


cos OT = yl ~ sin? Or = inf sin? Or- 1 


is imaginary. (Obviously, 97 can no longer be interpreted as an angle!) 


31 The evanescent fields can be detected by placing a second interface a short distance to the 


right of the first; in a close analog to quantum mechanical tunneling, the wave crosses the 
gap and reassembles to the right. See F. Albiol, S. Navas, and M. V. Andres, Am. J. Phys. 
61, 165 (1993). 
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Ny 


(a) Show that 
Ert, n) Ege el ©, (9.203) 


K= = Jon sin0z)? -n2 and k= = sin O7. (9.204) 


This is a wave propagating in the x direction (parallel to the interface!), and attenuated in 
the z direction. 

(b) Noting that æ (Eq. 9.109) is now imaginary, use Eq. 9.110 to calculate the reflection coeffi- 
cient for polarization parallel to the plane of incidence. [Notice that you get 100 percent 
reflection, which is better than at a conducting surface (see, for example, Prob. 9.23).] 

(c) Do the same for polarization perpendicular to the plane of incidence (use the results of 
Prob. 9.17). 

(d) In the case of polarization perpendicular to the plane of incidence, show that the (real) 
evanescent fields are 


where 


E(r, t) = Ege ** cos(kx - wt)y, 

E (9.205) 
Bar, f) = i sin(kx — wt)x + k cos(kx — wt)z]. 

(e) Check that the fields in (d) satisfy all of Maxwell's equations (Eq. 9.68). 


(f) For the fields in (d), construct the Poynting vector, and show that, on average, no energy is 
transmitted in the z direction. 


Problem 9.42. Consider the resonant cavity produced by closing off the two ends of a 
rectangular wave guide, atz = O andat z = d, making a perfectly conducting empty box. 
Show that the resonant frequencies for both TE and TM modes are given by 


Wimn = CTN (lld)? + (mla)? + (nib), (9.206) 


for integers /, m, and n. Find the associated electric and magnetic fields. 
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Problem 9.43. Energy conservation at the interface (Eqs. 9.114—9.118) is a little more subtle 
than I suggested in Section 9.3.3. Consider a volume V (with surface S, straddling the boundary. 
The energy per unit time transported by the fields out of this volume is given by Poynting’s 


theorem (Eq. 8.9): 
$ S-da=0 
S 


(at least, when time averaged over a cycle), since there is no energy source or sink in V. 


(a) Choose for Van infinitesimally thin disc extending just barely to either side of the interface. 
Show that 


Tp =IRRr+I7 


(Eq. 9.118), where these terms are defined by Eqs. 9.114 and 9.115. Beware: In medium 1 there 
are two superimposed fields: 


1 1 
Si a CR) Cire Be) = Se SR VE BA ErxBy)|; 


what happens to the cross terms? 

In practice we don't use plane waves (which would extend to infinity) but narrow beams 
— perhaps from a laser. Let’s pretend that the plane waves could somehow be confined 
to cylindrical tubes, forming the beams.” This time use a balloon-shaped volume that is 
punctured in three places by the incident, reflected, and transmitted beams (in fact, you 
might as well distort the surface S so it lies perpendicular to each beam at the puncture). 
Again, confirm Eq. 9.118; you won't need to worry about overlapping fields, but you do need 
to figure out the relative cross-sectional areas of the three beams. 


= 
o 
= 


Problem 9.44. (Optical tweezers) In Prob. 4.33 you showed that it is impossible to trap a 
neutral (but polarizable) particle in an electrostatic field. However, it is possible if you use time- 
dependent electromagnetic fields, such as a laser beam. Show that the force can be written in 
the form 


F= af 5VE? $ 2e x B)], (9.207) 


where æ is the polarizability. [Assume that the particle as a whole is at rest (or at any rate moving 
at much less than the speed of light), but its internal parts may be moving very rapidly. Treat it 
as an electric dipole at point r, consisting of charges +q separated by a displacement e. Thus 
you may drop dr/dt but you must keep de/dt.] 

In a light beam, for instance, the second term oscillates rapidly, and averages to zero, so the 


effective force is 
a 


F = o (9.208) 
just as in the static case (Prob. 4.33). But in the time-dependent régime Æ? can have a local 
maximum (for example, at the focal point of a laser beam), and this allows for optical trapping 
(“tweezers”). 


32 That’s actually impossible; a real light beam is much more complicated. See J. Lekner, 
Theory of Electromagnetic Beams, Morgan and Claypool, San Rafael (2020). 
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10.1 The Potential Formulation 
a See = 


10.1.1 Scalar and Vector Potentials 


In this chapter we seek the general solution to Maxwell’s equations, 


ive. ysi 
€9 Ot 
; (10.1) 
(ii) V-B=0, (iv) VxB= pod + poor: 


Given the sources p(r, t) and J(r, t), what are the fields E(r, t) and B(r, t)? In 
the static case, Coulomb’s law and the Biot—Savart law provide the answer. 
What we're looking for, then, is the generalization of those laws to time- 
dependent configurations. 

This is not an easy problem, and it pays to begin by representing the fields 
in terms of potentials. In electrostatics VxE = 0 allowed us to write E as the 
gradient of a scalar potential: E = -V V. In electrodynamics this is no longer 
possible, because the curl of E is nonzero. But B remains divergenceless, so 


we can still write 
B=VxA, (10.2) 


as In magnetostatics. Putting this into Faraday’s law (iii) yields 


0 
VxE=-—(VxA 
x aie xA 
or 
OA 
Vx(E+—)=0. 
RES) 0 


Here is a quantity, unlike E alone, whose curl does vanish; it can therefore 
be written as the gradient of a scalar: 


E+ as =-VV/. 
ot 
In terms of V and A, then, 
OA 
E=-VV - —. 10.3 
T (10.3) 


This reduces to the old form, of course, when A is constant. 
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The potential representation (Eqs. 10.2 and 10.3) automatically fulfills 
the two homogeneous Maxwell equations, (ii) and (iii). How about Gauss’s 
law (i) and the Ampére/Maxwell law (iv)? Putting Eq. 10.3 into (i), we 
find that 


ô 1 
V?V +—(V-A) = -— p; 10.4 
tg A Fi (10.4) 
this replaces Poisson’s equation (to which it reduces in the static case). 
Putting Eqs. 10.2 and 10.3 into (iv) yields 
OV OrA 
vx (V x A) = Lod = poeoV(——) = Holo a> 
or, using the vector identity V x (V x A) = V(V- A) - V7A, and rearranging 
the terms a bit, 
OPA OV 
2 = 
(v A- poceo) — V(V-A+ poceo) = — uoJ. (10.5) 


Equations 10.4 and 10.5 contain all the information in Maxwell’s equations. 


Example 10.1. Find the charge and current distributions that would 
give rise to the potentials 


kok 
rao Aol ae 
0 for |x| > ct, 


|x|)?z for |x| < ct, 


where k is a constant, and (of course) c = 1/V€ofo. 


Solution. First we'll determine the electric and magnetic fields, using 
Eqs. 10.2 and 10.3: 


OA Lok 
= age -5 ( t — |x|)2, 
k k 
Beyxdc c 2 ra er 
4c Ox 2c 


(plus, for x > 0; minus, for x < 0). These are for |x| < ct; when |x| > ct, 
E = B = 0 (Fig. 10.1). Calculating every derivative in sight, I find 


Hokct 
= 
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a 2° ðt 

As you can easily check, Maxwell’s equations are all satisfied, with p 

and J both zero. Notice, however, that B has a discontinuity at x = 0, 

and this signals the presence of a surface current K in the yz plane; 
boundary condition (iv) in Eq. 7.73 gives 
kty =Kx x, 


and hence 
K = kiz. 


Evidently, we have here a uniform surface current flowing in the 
z direction over the plane x = 0, which starts up at £ = 0, and 
increases in proportion to ¢. Notice that the “news” travels out (in 
both directions) at the speed of light: for points |x| > ct the message 
(“current is now flowing”) has not yet arrived, so the fields are zero. 


Problem 10.1. Show that the differential equations for V and A (Eqs. 10.4 and 10.5) can be 
written in the more symmetrical form 


ðt eo” (10.6) 


where 
2 


(a) OV 
2 2 = 
of =V- LHoeo A and L=V-A+ LHoeo T 


Problem 10.2. For the configuration in Ex. 10.1, consider a rectangular box of length /, width 
w, and height /, situated a distance d above the yz plane (Fig. 10.2). 


(a) Find the energy in the box at time ż; = d/c,and at t = (d + h)/c. 


Fig. 10.2 
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(b) Find the Poynting vector, and determine the energy per unit time flowing into the box 
during the interval t; < t < fo. 

(c) Integrate the result in (b) from z; to ¢>, and confirm that the increase in energy (part (a)) 
equals the net influx. 


10.1.2 Gauge Transformations 


Equations 10.4 and 10.5 are ugly, and you might be inclined to abandon the 
potential formulation altogether. However, we have succeeded in reducing 
six problems — finding E and B (three components each) — down to four: V 
(one component) and A (three more). Moreover, Eqs. 10.2 and 10.3 do not 
uniquely define the potentials; we are free to impose extra conditions on 
V and A, as long as nothing happens to E and B. Let’s work out precisely 
what this gauge freedom entails. 

Suppose we have two sets of potentials, (V,A) and (V’, A’), which 
correspond to the same electric and magnetic fields. By how much can they 
differ? Write 


A’=A+a@ and V’=V+8. 
Since the two As give the same B, their curls must be equal, and hence 
Vxa=0. 
We can therefore write æ as the gradient of a scalar: 
a=W. 
The two potentials also give the same E, so 


0a 
v — =0, 
di7 


or 
ðA 
v(6 + a =0. 


The term in parentheses is therefore independent of position (it could, 
however, depend on time); call it k(t): 

p= E + k(t). 
Actually, we might as well absorb k(t) into A, defining a new A by adding 
i; k(t’) dt’ to the old one. This will not affect the gradient of A; it just adds 
k(t) to 0A/dt. It follows that 


A’ =A+VA, 
(10.7) 
oe eee 
ar 


Conclusion: For any old scalar function A(r, t), we can with impunity add 
VA to A, provided we simultaneously subtract 0A/dt from V. This will 
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not affect the physical quantities E and B. Such changes in V and A 
are called gauge transformations. They can be exploited to adjust the 
divergence of A, with a view to simplifying the “ugly” equations 10.4 
and 10.5. In magnetostatics, it was best to choose V - A =0 (Eq. 5.63); 
in electrodynamics, the situation is not so clear-cut, and the most con- 
venient gauge depends to some extent on the problem at hand. There 
are many famous gauges in the literature; I'll show you the two most 
popular ones. 


Problem 10.3. 


(a) Find the fields, and the charge and current distributions, corresponding to 


t 
Ti 


Va,t=0, Ar, = are „2 


(b) Use the gauge function A = — (1/4rreo)(qt/r) to transform the potentials, and com- 
ment on the result. 


Problem 10.4. Suppose V =0 and A= Ag sin(kx — wt)ŷ, where Ao, œw, and k are 
constants. Find E and B, and check that they satisfy Maxwell's equations in vacuum. What 
condition must you impose on œw and k? 


10.1.3 Coulomb Gauge and Lorenz Gauge 


The Coulomb Gauge. As in magnetostatics, we pick 
V-A=0. (10.8) 
With this, Eq. 10.4 becomes 
2 1 
VV =-—p. (10.9) 
€0 


This is Poisson’s equation, and we already know how to solve it: setting 
V = 0 at infinity, 


Vi.) = | PO? ae (10.10) 


There is a very peculiar thing about the scalar potential in the Coulomb 
gauge: it is determined by the distribution of charge right now. If I move an 
electron in my laboratory, the potential V on the moon immediately records 
this change. That sounds particularly odd in the light of special relativity, 
which allows no message to travel faster than c. The point is that V by 
itself is not a physically measurable quantity — all the man in the moon can 
measure is E, and that involves A as well (Eq. 10.3). Somehow it is built into 
the vector potential (in the Coulomb gauge) that whereas V instantaneously 
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reflects all changes in p, the combination -V V — (ôA/ôðt) does not; E will 
change only after sufficient time has elapsed for the “news” to arrive.! 

The advantage of the Coulomb gauge is that the scalar potential is 
particularly simple to calculate; the disadvantage (apart from the acausal 
appearance of V) is that A is particularly difficult to calculate. The differ- 
ential equation for A (Eq. 10.5) in the Coulomb gauge reads 


07A 
VA- 
Hoeg Or 


The Lorenz gauge. In the Lorenz” gauge, we pick 


ôV 
= -uoJ + Hoeo¥(=—). (10.11) 


V-A = -peo A. (10.12) 


This is designed to eliminate the middle term in Eq. 10.5 (in the language 
of Prob. 10.1, it sets L = 0). With this, 


A 
2 _ 
VA- Hoeo De = —pod. (10.13) 
Meanwhile, the differential equation for V (Eq. 10.4) becomes 
8V 1 
2y L =-—9, .14 
V4 V — Uo€o J2 ae (10.14) 


The virtue of the Lorenz gauge is that it treats V and A on an equal 
footing: the same differential operator 


2 
y? = Heo =O (10.15) 


(called the d’Alembertian) occurs in both equations: 


. 2 1 
oV =-—p, 
© a (10.16) 
(ii) D’A = -uoJ. 


This democratic treatment of V and A is especially nice in the context of 
special relativity, where the d’Alembertian is the natural generalization of 
the Laplacian, and Eqs. 10.16 can be regarded as four-dimensional versions 
of Poisson’s equation. In this same spirit, the homogeneous wave equation 
o?f = 0 might be regarded as the four-dimensional version of Laplace’s 
equation. In the Lorenz gauge, V and A satisfy the inhomogeneous wave 
equation, with a “source” term (in place of zero) on the right. From now on, 
I shall use the Lorenz gauge exclusively, and the whole of electrodynamics 


! See O. L. Brill and B. Goodman. Am. J. Phys. 35, 832 (1967); J. D. Jackson, Am. J. Phys. 70, 
917 (2001). 

? Until recently, it was spelled “Lorentz,” in honor of the Dutch physicist H. A. Lorentz, 
but it is now attributed to L. V. Lorenz, the Dane. See J. Van Bladel, IEEE Antennas and 
Propagation Magazine 33(2), 69 (1991); J. D. Jackson and L. B. Okun, Rev. Mod. Phys. 73, 
663 (2001). 
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reduces to the problem of solving the inhomogeneous wave equation for a 
specified source. 


Problem 10.5. Which of the potentials in Ex. 10.1, Prob. 10.3, and Prob. 10.4 are in the Coulomb 
gauge? Which are in the Lorenz gauge? (Notice that these gauges are not mutually exclusive.) 


Problem 10.6. In Chapter 5,| showed that it is always possible to pick a vector potential whose 
divergence is zero (the Coulomb gauge). Show that it is always possible to choose V - A = 
—Lg€o(OV/dt), as required for the Lorenz gauge, assuming you know how to solve the 
inhomogeneous wave equation (Eq. 10.16). Is it always possible to pick V = 0? How about 
A= 0? 


Problem 10.7. A time-dependent point charge g(z) at the origin, o (r, t) = q(t)53(r), is 
fed bya current J (r, £) = —(1/472)(q/r)e, where ġ = dqldt. 


(a) Check that charge is conserved, by confirming that the continuity equation is obeyed. 

(b) Find the scalar and vector potentials in the Coulomb gauge. If you get stuck, try working on 
(o) first. 

(c) Find the fields, and check that they satisfy all of Maxwell's equations.? 


10.1.4 Lorentz Force Law in Potential Form 


It* is illuminating to express the Lorentz force law in terms of potentials: 
dp OA 
F = 7 = q(E+vxB) =q|-WV - = +vx V xA], (10.17) 
where p = mv is the momentum of the particle. Now, product rule 4 says 


Viv- A) =vx(VxXA)+(v- VDA 


(v, the velocity of the particle, is a function of time, but not of position). 


Thus 


dp OA 
alg tH WAVY -v- AY], (10.18) 


The combination 
OA 
Fn + (v: V)A] 


is called the convective derivative of A, and written dA/dt (total derivative). 
It represents the time rate of change of A at the (moving) location of the 
particle. For suppose that at time ¢ the particle is at point r, where the 
potential is A(r, £); a moment dt later it is at r + v dt, where the potential 
is A(r + vdt, t + dt). The change in A, then, is 


3 P R. Berman, Am. J. Phys. 76, 48 (2008). 
4 This section can be skipped without loss of continuity. 
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dA = A(r+vdt,t + dt) — A(t, t) 


= (Joa + Eo dt) + (Ayo. dt) + (=) dt, 
SO 
A = 2A + (v- V)A. (10.19) 


As the particle moves, the potential it “feels” changes for two distinct 
reasons: first, because the potential varies with time, and second, because 
it is now in a new location, where A is different because of its variation in 
space. Hence the two terms in Eq. 10.19. 

With the aid of the convective derivative, the Lorentz force law reads: 


d 
Wet) =-Vi[q(V -v-A)]. (10.20) 


This is reminiscent of the standard formula from mechanics, for the motion 
of a particle whose potential energy U is a specified function of position: 


% yy 
Playing the role of p is the so-called canonical momentum, 
Pean = P + qA, (10.21) 
while the part of U is taken by the velocity-dependent quantity 
Uva =4(V - v- A). (10.22) 


A similar argument (Prob. 10.9) gives the rate of change of the particle’s 
energy: 


d ð 
Iome T oA (10.23) 


where T = smv is its kinetic energy and qV is its potential energy (the 
derivative on the right acts only on V and A, not on y). Curiously, the 
same quantity? Uye appears on the right-hand side of both equations. The 
parallel between Eq. 10.20 and Eq. 10.23 invites us to interpret A as a kind 
of “potential momentum” per unit charge, just as V is potential energy per 
unit charge.® 


Problem 10.8. The vector potential for a uniform magnetostatic field is A = -S(r x B) 
(Prob. 5.27). Show that dA/dt = -4 (v x B), in this case, and confirm that Eq. 10.20 yields 
the correct equation of motion. 


Problem 10.9. Derive Eq. 10.23. [Hint: Start by dotting v into Eq. 10.17.] 


5 I don’t know what to call Uye — it’s not potential energy, exactly (that would be qV). 


6 There are other arguments for this interpretation, which Maxwell himself favored, and many 
modern authors advocate. For a fascinating discussion, see M. D. Semon and J. R. Taylor, 
Am. J. Phys. 64, 1361 (1996). Incidentally, it is the canonical angular momentum (derived 
from Pean), not the mechanical portion alone, that is quantized — see R. H. Young, Am. J. 
Phys. 66, 1043 (1998). 
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10.2 Continuous Distributions 
EER] 
10.2.1 Retarded Potentials 
In the static case, Eq. 10.16 reduces to (four copies of) Poisson’s equation, 
VV = PLP VA =—Hod, 

€0 


with the familiar solutions 


Vn) = Tr ee Lit Ay = nf d ) ad’, (10.24) 
where +, as always, is the distance from the source point r’ to the field point 
r (Fig. 10.3). Now, electromagnetic “news” travels at the speed of light. In 
the nonstatic case, therefore, it’s not the status of the source right now that 
matters, but rather its condition at some earlier time ¢, (called the retarded 
time) when the “message” left. Since this message must travel a distance 2, 
the delay is 2/c: 


(10.25) 


The natural generalization of Eq. 10.24 for nonstatic sources is therefore 


S 1 pr’, ty) , = Ho J(r’, ty) , 
V(r,t) = Ane) rs dt’, aay E f Bar 


(10.26) 
Here p(r’, t+) is the charge density that prevailed at point r’ at the retarded 
time ¢,. Because the integrands are evaluated at the retarded time, these are 
called retarded potentials. (I speak of “the” retarded time, but of course 
more distant parts of the charge distribution have earlier retarded times 
than nearby ones. It’s just like the night sky: the light we see now left each 
star at the retarded time corresponding to that star’s distance from the 
Earth.) Note that the retarded potentials reduce properly to Eq. 10.24 in 
the static case, for which p and J are independent of time. 
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Well, that all sounds reasonable — and surprisingly simple. But are we sure 
it’s right? I didn’t actually derive the formulas for V and A (Eq. 10.26); all 
I did was invoke a heuristic argument (“electromagnetic news travels at the 
speed of light”) to make them seem plausible. To prove them, I must show 
that they satisfy the inhomogeneous wave equation (Eq. 10.16) and meet 
the Lorenz condition (Eq. 10.12). In case you think I’m being fussy, let me 
warn you that if you apply the same logic to the fields you'll get entirely the 
wrong answer (see Prob. 10.36 for the correct argument): 


1 p(t’, ty) a uo ( J(r’,t,) x2 
th + — | —— hadr, B, ) + — | — dT’. 
Eea Ane Í a Ca A An Í a? : 
Lets stop and check, then, that the retarded scalar potential satisfies 
Eq. 10.16; essentially the same argument would serve for the vector poten- 


tial.” I shall leave it for you (Prob. 10.10) to show that the retarded potentials 
obey the Lorenz condition. 


Proof. In calculating the Laplacian of V(r, t), the crucial point to 
notice is that the integrand (in Eq. 10.26) depends on r in two places: 
explicitly, in the denominator (2 = |r — r'|), and implicitly, through 
t, = t — lc, in the numerator. Thus 


1 1 I, 
Pea Sivo} + pv(5)I dt’, (10.27) 

and 
Vp = pVt, = = pvv. (10.28) 


(The dot denotes differentiation with respect to time). Now V2 = 2 
and V(1/z) = —%2/2? (Prob. 1.13), so 


1 pr å 
_ see. r, 10.2 
vy zz SI T pra at (10.29) 
Taking the divergence, 
1 1,2 a 
2y = ——|_—. ; iV -(— 
Vag) i, Ore) 


A A , 
- [52° V aA 


But 
Vj- l on = ls 
A ca ce 
as in Eq. 10.28, and 


7 TII give you the straightforward but cumbersome proof; for a clever indirect argument see 


M. A. Heald and J. B. Marion, Classical Electromagnetic Radiation, 3rd ed., Saunders, 
Orlando, FL (1995), Sect. 8.1. 
8 Note that 8/ðt, = 0/0t, since t, = t — z/c and 2 is independent of f. 
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(Prob. 1.63), whereas 


(Eq. 1.100). So 


5 1 1 p 3 r 1 V 1 
Vy |== -4npô (2)] dt’ = Zon at 


= 4T E0 en t 
confirming that the retarded potential (Eq. 10.26) satisfies the inho- 
mogeneous wave equation (Eq. 10.16). o 


Incidentally, this proof applies equally well to the advanced potentials, 
1 pa’, ta) Lo J(r’, ta) 
V, So. | ES e's A == | —_“dr’, (10. 
a(t, t) ae! 7 dt’, Alr, f) n f zat, (10.30) 


in which the charge and the current densities are evaluated at the 


advanced time 
KA 


ta=t+ z (10.31) 
A few signs are changed, but the final result is unaffected. Although the 
advanced potentials are entirely consistent with Maxwell’s equations, they 
violate the most sacred tenet in all of physics: the principle of causality. 
They suggest that the potentials now depend on what the charge and the 
current distribution will be at some time in the future — the effect, in other 
words, precedes the cause. Although the advanced potentials are of some 


theoretical interest, they have no direct physical significance.’ 


Example 10.2. An infinite straight wire carries the current 


0 fort <0, 
I(t) = 
Ih fort> 0. 


That is, a constant current Jp is turned on abruptly at t=0. Find the 
resulting electric and magnetic fields. 


Solution. The wire is presumably electrically neutral, so the scalar 
potential is zero. Let the wire lie along the z-axis (Fig. 10.4); the 
retarded vector potential at point P is 


Ho . E I(t.) 
A(s, t) = — dz. 
(a) ma f 2 á 
For ¢ < s/c, the “news” has not yet reached P, and the potential is zero. 
For t > s/c, only the segment 


=œ 


Ne} 


Because the d’Alembertian involves 1? (as opposed to #), the theory itself is time-reversal 
invariant, and does not distinguish “past” from “future.” Time asymmetry is introduced 
when we select the retarded potentials in preference to the advanced ones, reflecting the (not 
unreasonable!) belief that electromagnetic influences propagate forward, not backward, in 
time. 
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dz 


Iz| < (ct)? - s2 (10.32) 


contributes (outside this range f, is negative, so I(t.) = 0); thus 


I y (ct)? —s? d 
acs, ) = (=23)2 = 


4T NVS 4 22 
Io. V (ct)?-s2 I Vv t)2 -—s2, . 
= Koo 22 In( Vs? + 2? + 2)| i = aa gaat ys a. 
27 0 27 KY 
The electric field is 
A I 
K(s, t) = ae = Holoc 7 


3! 2, 
ot 2nv (ct)? — s? 


and the magnetic field is 


ðA- _ Lolo ct z 


B(s, £) = V x A = -— o = . 
MUSEA ðs 2ns Set? =" 
Notice that as tf — oo we recover the static case: E = 0, B = 
(uolo 2ns)o. 


! Problem 10.10. Confirm thatthe retarded potentials satisfy the Lorenz gauge condition. [Hint: 
First show that 


v- (Ž) = Lw-D+ vD -v (Ż), 


where V denotes derivatives with respect to r, and V’ denotes derivatives with respect to r’. 
Next, noting that J(r’, £ — 2/c) depends on r’ both explicitly and through 2, whereas it 
depends on r only through ~, confirm that 


1. 1. 
Vas NY) Yoko po ON). 
Use this to calculate the divergence of A (Eq. 10.26).] 


! Problem 10.11. 
(a) Suppose the wire in Ex. 10.2 carries a linearly increasing current 
I(t) = kt, 


for ¢ > O. Find the electric and magnetic fields generated. 
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(b) Do the same for the case of a sudden burst of current: 
I(t) = qo (t). 


Problem 10.12. A piece of wire bent into a loop, as shown in Fig. 10.5, carries a current that 
increases linearly with time: 


I(t)=kt (—œ < t< œ). 


Calculate the retarded vector potential A at the center. Find the electric field at the center. Why 
does this (neutral) wire produce an electric field? (Why can't you determine the magnetic field 
from this expression for A?) 


Fig. 10.5 


10.2.2 Jefimenko’s Equations 


Given the retarded potentials 


= 1 p(t’, tr) , = Ho Jr’, t) , 
V(r, t) = af 7 dT’, amoi f z dt’, (10.33) 


it is, in principle, a straightforward matter to determine the fields: 


A 
E=-wy- B=VxA. (10.34) 


But the details are not entirely trivial because, as I mentioned earlier, 
the integrands depend on r both explicitly, through 2 = |r — r'| in the 
denominator, and implicitly, through the retarded time t, = t — 2/c in the 
argument of the numerator. 

I already calculated the gradient of V (Eq. 10.29); the time derivative of 


A is easy: 
ðA Lo J 
— = — | -dr. 10. 

ot mfz k to) 


Putting them together (and using c? = 1/uo€0), 


= 1 pa’, t,) ^ pa’, t,) & Jr, ty) , 
Er, t) = zz JI a Jar. (10.36) 


This is the time-dependent generalization of Coulomb’s law, to which it 
reduces in the static case (where the second and third terms drop out and 
the first term loses its dependence on ¢,). 


455 


10.2 Continuous Distributions 


As for B, the curl of A contains two terms: 


= Lo 1 1 ; 
vxa= E [ [xd -Ixv(2)] dr’ 
Now 
ðJ, 
a Ae L A 
Oz 
and 
OJ. ðt, 1,02 
=J,—=--J 
oy “Oy c Oy 
SO 
1,. ð% . ð% 
e e a = ae 
But V2 = 2 (Prob. 1.13), so 
VxJ= lj x 2. (10.37) 


Meanwhile, V(1/z) = —2/2? (again, Prob. 1.13), and hence 


Bar, = rfe Je’, Ol xa (10.38) 


This is the time-dependent generalization of the Biot-Savart law, to which 
it reduces in the static case. 

Equations 10.36 and 10.38 are the (causal) solutions to Maxwell’s 
equations. For some reason, they do not seem to have been published 
until quite recently — the earliest explicit statement of which I am aware 
was by Oleg Jefimenko, in 1966.!° In practice Jefimenko’s equations are 
of limited utility, since it is typically easier to calculate the retarded 
potentials and differentiate them, rather than going directly to the fields. 
Nevertheless, they provide a satisfying sense of closure to the theory. 
They also help to clarify an observation I made in the previous section: 
to get to the retarded potentials, all you do is replace t by t, in the 
electrostatic and magnetostatic formulas, but in the case of the fields 
not only is time replaced by retarded time, but completely new terms 
(involving derivatives of p and J) appear. And Jefimenko’s equations 
provide surprisingly strong support for the quasistatic approximation (see 
Prob. 10.14). 


10 ©. D. Jefimenko, Electricity and Magnetism, Appleton-Century-Crofts, New York (1966), 
Sect. 15.7. Related expressions appear in G. A. Schott, Electromagnetic Radiation, Cam- 
bridge University Press, Cambridge, UK (1912), Chap. 2; W. K. H. Panofsky and M. 
Phillips, Classical Electricity and Magnetism, Addison-Wesley, Reading, MA (1962), 
Sect. 14.3, and elsewhere. See K. T. McDonald, Am. J. Phys. 65, 1074 (1997) for illuminating 
commentary and references. 
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Problem 10.13. Suppose J(r) is constant in time, so (Prob. 7.63) p(r, £) = p(r,0) + 
p (r, 0)z. Show that 


1 p(t’, t) 
Atte a2 


E(r, t) = ad’; 


that is, Coulomb's law holds, with the charge density evaluated at the nonretarded time. 


Problem 10.14. Suppose the current density changes slowly enough that we can (to good 
approximation) ignore all higher derivatives in the Taylor expansion 


I(t) = S(t) + (tt - DS (1) + 
(for clarity, | suppress the r-dependence, which is not at issue). Show that a fortuitous cancela- 
tion in Eq. 10.38 yields 


fi à 
Be! ee 
4r 72 


That is, the Biot—Savart law holds, with J evaluated at the nonretarded time. This means that 
the quasistatic approximation is actually much better than we had any right to expect: the two 
errors involved (neglecting retardation and dropping the second term in Eq. 10.38) cancel one 
another, to first order. 


! Problem 10.15. Obtain the fields for Ex. 10.2 directly from Jefimenko’s equations." 


10.3 Point Charges 


10.3.1 Liénard—Wiechert Potentials 
My next project is to calculate the (retarded) potentials, V(r, £) and A(r, t), 
of a point charge g that is moving on a specified trajectory 
w(t) = position of q at time t. (10.39) 


A naive reading of the formula (Eq. 10.26) 


1 prt), 
Ta zdr (10.40) 


V(r,t) = 


might suggest to you that the potential is simply 


l q 
ÅTEg 2 


(the same as in the static case, with the understanding that 7 is the distance 
to the retarded position of the charge). But this is wrong, for a very subtle 


11 D, V. Redžić and V. Hnizdo, Eur. J. Phys. 34, 495 (2013). 
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reason: it is true that for a point source the denominator 2 comes outside 
the integral,!? but what remains, 


f pee dt’, (10.41) 


is not equal to the charge of the particle (and depends, through ¢,, on the 
location of the point r). To calculate the total charge of a configuration, you 
must integrate p over the entire distribution at one instant of time, but here 
the retardation, t, = t — 2/c, obliges us to evaluate p at different times for 
different parts of the configuration. If the source is moving, the result will 
not be the total charge. You might think that this problem would disappear 
for point charges, but it doesn’t. In Maxwell’s electrodynamics, formulated 
as it is in terms of charge and current densities, a point charge must be 
regarded as the limit of an extended charge, when the size goes to zero. And 
for an extended particle, no matter how small, the retardation in Eq. 10.41 
throws in a factor (1 —%-v/c)~!, where v is the velocity of the charge at the 
retarded time: 
, , q 
foe sidt = E (10.42) 

Proof. This is a purely geometrical effect, and it may help to tell 

the story in a less abstract context. You will not have noticed it, for 

obvious reasons, but the fact is that a train coming toward you looks 

a little longer than it really is, because the light you receive from the 

caboose left earlier than the light you receive (at the same instant) 

from the engine, and at that earlier time the train was farther away 

(Fig. 10.6). In the interval it takes light from the caboose to travel 

that longer distance L’, the train itself moves a distance L’ — L: 

V Ľ-L L 


Pe 
c v? i 1 — vlc 


So approaching trains appear longer, by a factor (1 — v/ce)7!. By 
contrast, a train going away from you looks shorter,’ by a factor 
(1 + v/c)7!. In general, if the train’s velocity makes an angle @ with 


2 There is, however, an implicit change in its functional dependence: before the integration, 


4 = |r-r’| is a function of r and r’; after the integration, which fixes r’ = w(t,), 2 = |r—w(t;)| 
is (like ¢,) a function of r and t. 

3 Please note that this has nothing whatever to do with special relativity or Lorentz contrac- 
tion — L is the length of the moving train, and its rest length is not at issue. The argument 
is somewhat reminiscent of the Doppler effect. 


458 10 Potentials and Fields 


—> y 


To 
observer 


your line of sight,'* the extra distance light from the caboose must 
cover is L’ cos @ (Fig. 10.7). In the time L’ cos @/c, then, the train 
moves a distance (L’ — L): 

LĽ'cos0 L’-L L 


, or lL’ = ———_., 
Cc v 1 — v cos O/c 


Notice that this effect does not distort the dimensions perpendic- 
ular to the motion (the height and width of the train). Never mind 
that the light from the far side is delayed in reaching you (relative to 
light from the near side) — since there’s no motion in that direction, 
they’ll still look the same distance apart. The apparent volume t’ of 
the train, then, is related to the actual volume t by 
T 


T= —_—__, (10.43) 
1-42-v/c 
where % is a unit vector from the train to the observer. 

In case the connection between moving trains and retarded 
potentials eludes you, the point is this: whenever you do an integral 
of the type in Eq. 10.41, in which the integrand is evaluated at the 
retarded time, the effective volume is not the same as the actual 
volume; rather, it is the latter modified by the factor in Eq. 10.43, 
just as the apparent volume of the train was. Because this correction 
factor makes no reference to the size of the particle, it is every bit as 
significant for a point charge as for an extended charge. m 


Meanwhile, for a point charge the retarded time is determined implicitly 
by the equation 


Ir — w(t,)| = c(t — t,). (10.44) 
The left-hand side is the distance the “news” must travel, and (t — t,) is the 


time it takes to make the trip (Fig. 10.8); 2 is the vector from the retarded 
position to the field point r: 


a =r- w(t). (10.45) 


14 T assume the train is far enough away or (more to the point) short enough so that rays from 
the caboose and engine can be considered parallel. 
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Retarded 
position Particle 
trajectory 
Present 


q es position 


‘ 


It is important to note that at most one point on the trajectory is “in 
communication” with r at any particular time t. For suppose there were 
two such points, with retarded times ¢; and fy: 


A,=c(t-t)) and % = c(t- t). 


Then 2; -%2 = c(t2- t1), so the average speed of the particle in the direction 
of the point r would have to be c — and that’s not counting whatever velocity 
the charge might have in other directions. Since no charged particle can 
travel at the speed of light, it follows that only one retarded point contributes 
to the potentials, at any given moment.'> 

It follows, then, that 


1 qc 
V(r, t) = an Ane (2c — a.v)’ (10.46) 


where v is the velocity of the charge at the retarded time, and % is the vector 
from the retarded position to the field point r. Moreover, since the current 
density is pv (Eq. 5.26), the vector potential is 


Ar = f PO ga f -EX foe, t) dT’, 


or 


_ Ho qcv 
aa) = 4r (c-a. “V 


2 “Ver, t). (10.47) 


Equations 10.46 and 10.47 are the famous Liénard—Wiechert potentials for 
a moving point charge.!® 


15 For the same reason, an observer at r sees the particle in only one place at a time. By 
contrast, it is possible to hear an object in two places at once. Consider a bear who growls 
at you and then runs toward you at the speed of sound and growls again; you hear both 
growls at the same time, coming from two different locations, but there’s only one bear. 

6 There are many ways to obtain the Liénard—Wiechert potentials. I have tried to emphasize 
the geometrical origin of the factor (1—%-v/c)~!; for illuminating commentary, see W. K. H. 
Panofsky and M. Phillips, Classical Electricity and Magnetism, 2nd ed., Addison-Wesley, 
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Example 10.3. Find the potentials of a point charge moving with 
constant velocity. 


Solution. For convenience, let’s say the particle passes through the 
origin at time ¢ = 0, so that 


w(t) = vt. 
We first compute the retarded time, using Eq. 10.44: 
lr —vt,| = e(t — t,), 
or, squaring: 
Pr —2&-vt,tve = eC- 2tt, + 2). 


Solving for t, by the quadratic formula, we find that 


S (Pt-r-v) + (t-r: v}? +(e- n) - er) 


a (10.48) 


r 


To fix the sign, consider the limit v = 0: 
r 


t =t+-. 
C 


In this case the charge is at rest at the origin, and the retarded time 
should be (t — r/c); evidently we want the minus sign. 
Now, from Eqs. 10.44 and 10.45, 
n r- vé, 
L=c(t— ty d 2=——_, 
ee a S 
so 


e=] vr v 


a v 
a-a t) l=, = c(t t)- + oh 


= Iet- r) - (P -t 


= Lfe => y)? + (c2 = v2) (12 _ Cr) 
c 


(I used Eq. 10.48, with the minus sign, in the last step). Therefore, 


qe 


V >t = > 
0,9 4neo f(2t—r-vy + (C2 —P)\(P Ee) 


(10.49) 


and (Eq. 10.47) 


qcv 


At, t) = £ 


; 10.50 
4r (t-r: vV) + (2 -vr — Pr) ( ) 


Reading, MA (1962), pp. 342-343. A more rigorous derivation is provided by J. R. Reitz, 


F. J. Milford, and R. W. Christy, Foundations of Electromagnetic Theory, 3rd ed., Addison- 
Wesley, Reading, MA (1979), Sect. 21.1, or M. A. Heald and J. B. Marion, Classical 
Electromagnetic Radiation, 3rd ed., Saunders, Orlando, FL (1995), Sect. 8.3. 
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Problem 10.16. A particle of charge q moves in a circle of radius a at constant angular velocity 
w. (Assume that the circle lies in the xy plane, centered at the origin, and at time ¢ = O the 
charge is at (a, 0), on the positive x-axis.) Find the Liénard—Wiechert potentials for points on 
the z-axis. 


Problem 10.17. Show that the scalar potential of a point charge moving with constant velocity 


(Eq. 10.49) can be written more simply as 


Vir, t) = (10.51) 


1 q 
4neo RY1 — v2 sin? 0/2 
where R =r — v¢ is the vector from the present(!) position of the particle to the field point 
r, and @ is the angle between R and v (Fig. 10.9). Note that for nonrelativistic velocities 


(v2 «c?), 

q 
V(r,t) = =, 
( ) 4T EQ R 

R 

7 
q y 
Fig. 10.9 


Problem 10.18. | showed that at most one point on the particle trajectory communicates with 
rat any given time. In some cases there may beno such point (an observer at r would not see the 
particle — in the colorful language of general relativity, itis “over the horizon”). As an example, 
consider a particle in hyperbolic motion along the x-axis: 


w(t) = 4/b2 + (ct)? (=œ < t< œ). (10.52) 


(In special relativity, this is the trajectory of a particle subject to a constant force F = mc?/b.) 
Sketch the graph of w versus ¢. At four or five representative points on the curve, draw the 
trajectory of a light signal emitted by the particle at that point — both in the plus x direction 
and in the minus x direction. What region on your graph corresponds to points and times (x, £) 
from which the particle cannot be seen? At what time does someone at point x first see the 
particle? (Prior to this the potential at x is zero.) Is it possible for a particle, once seen, to 
disappear from view? 


Problem 10.19. Determine the Liénard—Wiechert potentials for a charge in hyperbolic motion 


(Eq. 10.52). Assume the point r is on the x-axis and to the right of the charge (x > w(t,)).” 


17 The fields of a point charge in hyperbolic motion are notoriously tricky. Indeed, a 


straightforward application of the Liénard—Wiechert potentials yields an electric field in 
violation of Gauss’s law. This paradox was resolved by Bondi and Gold in 1955. For a 
history of the problem, see E. Eriksen and Ø. Grøn, Ann. Phys. 286, 320 (2000). 
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10.3.2 The Fields of a Moving Point Charge 


We are now in a position to calculate the electric and magnetic fields of a 
point charge in arbitrary motion, using the Liénard—Wiechert potentials:'® 


1 qc v 
t) = — ——., Alr, f) =-—Va,t 10. 
Vor, t) ie, =A (r,t) z (r,t), (10.53) 
and the equations for E and B: 
A 
preys B=VxA. 
Ot 


The differentiation is tricky, however, because 
a=r-w(t) and v=Ww(t,) (10.54) 
are both evaluated at the retarded time, and t, — defined implicitly by the 
equation 
Ir — w(t,)| = c(t — t,) (10.55) 
— is itself a function of r and 1.!? So hang on: the next two pages are rough 
going ... but the answer is worth the effort. 
Let’s begin with the gradient of V: 
qe -1 


VV = 
ATE) (2c — A- v) 


zV(ac- a.: y). (10.56) 
Since 2 = c(t — t,),7° 
V2 = -cVt,. (10.57) 
As for the second term, product rule 4 gives 
V(42-v) =(4-V)v+(v-V)2+2x(VXv)+vx(VX2). (10.58) 


Evaluating these terms one at a time, 


ô ð ô 
(2-V)v = (2.5 + wz et 2:52 )V(tr) 
pelt ate gy Mate, av dt 
“dt,dx dt, ðy “dt, Oz 


=a(2z- Vt), (10.59) 


where a = Vis the acceleration of the particle at the retarded time. Now 


(v-V)z=(v- V)r—-(v- V)w, (10.60) 


18 You can get the fields directly from Jefimenko’s equations, but it’s not easy. See, for example, 
M. A. Heald and J. B. Marion, Classical Electromagnetic Radiation, 3rd ed., Saunders, 
Orlando, FL (1995), Sect. 8.4. 

19 The following calculation is done by the most direct, “brute-force” method. For a more 
clever and efficient approach, see J. D. Jackson, Classical Electrodynamics, 3rd ed., John 
Wiley, New York (1999), Sect. 14.1. 

20 Remember that 2 = r - w(t,) (Fig. 10.8), and t, is itself a function of r. Contrast Prob. 1.13 
(and Section 10.2), where 2 = r —r’ (Fig. 10.3), and r’ was an independent variable. In that 
case V2 = %, but here we have a more complicated problem on our hands. 
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and 
(v-V)r= z- + "ee + A Yo + yy + 22) 
= Vxů + v, + V-Z=V, (10.61) 
while 


(v- V)w = v(v- Vt) 


(same reasoning as Eq. 10.59). Moving on to the third term in Eq. 10.58, 


Ov. OY) 4 (ae y pee a 


TRS eter dz Ox yt ôx Oy 


= (Gah 2 o aa 


dt, ôy dt; Oz dt, dz dt, ox!” 


dv, Ot, dvx Oty. 
(T ax dt, ay) 
=-axVt,. (10.62) 
Finally, 
Vx2=Vxr-Vxw, (10.63) 
but V xr = 0, while, by the same argument as Eq. 10.62, 
Vxwe=-vx Vi. (10.64) 
Putting all this back into Eq. 10.58, and using the “BAC-CAB” rule to 


reduce the triple cross products, 
V(2-v) =a(z- Vt,) +vV—v(v- Vt,) -2xX (ax Vt.) +V X (vx Vt,) 
=v+(2-a-v’)Vt,. (10.65) 
Collecting Eqs. 10.57 and 10.65, we have 


qc 1 


Vy = 
4TEo (2c — A- v)? 


w+ -v +a-a)Vi]. (10.66) 


To complete the calculation, we need to know Vt.. This can be found by 
taking the gradient of the defining equation (Eq. 10.55) — which we have 
already done in Eq. 10.57 — and expanding V2: 


-cVt, = V2 = WVa àa = V(2-2) 


1 
2V% -A 
= liw vm+ax (V x2). (10.67) 
But 
(a-Vja=a-va. Yt) 
(same idea as Eq. 10.60), while (from Eqs. 10.63 and 10.64) 


Vxa2= (vx Vi,;). 
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Thus 
1 1 
—cVt, = ra —v(2- Vt.) +4X (vx Vi,)] = z? —(a2-v)Vt,], 
and hence 
—^A 
Vt, = ——. (10.68) 
4C-%°-V 
Incorporating this result into Eq. 10.66, I conclude that 
1 
vy I ire-a:vy- (e-v +a-aja]. (10.69) 


= 4TEo (2c -A y)? 
A similar calculation, which I shall leave for you (Prob. 10.20), yields 
ðA l qc 

ðt 4TEo (2c-4-Vv)3 


[e —%-v)(-v+ alc) 


+ ze -v +a-a)]. (10.70) 
Combining these results, and introducing the vector 
u = c2-V, (10.71) 
I find 


fens! —* 


2.2 
a a LA vu +% x (u x a)]. (10.72) 


Meanwhile, 


1 1 
VxA=— Vx (Vv) = 
c 


ZIV xv) -vx (VP). 


We have already calculated V x v (Eq. 10.62) and VV (Eq. 10.69). Putting 
these together, 


1 1 
VxA=-- q 


c 4TEo (u- mi x (2 — vt (2-a)v + (2 u)al. 


The quantity in brackets is strikingly similar to the one in Eq. 10.72, which 
can be written, using the BAC-CAB rule, as [(c? — v?)u+ (% - a)u — (% - u)a]; 
the main difference is that we have v’s instead of u’s in the first two terms. 
In fact, since it’s all crossed into 2 anyway, we can with impunity change 
these v’s into —u’s; the extra term proportional to 2 disappears in the cross 
product. It follows that 


Ba, t) = La x E(r, t). (10.73) 


Evidently, the magnetic field of a point charge is always perpendicular to the 
electric field, and to the vector from the retarded point. 

The first term in E (the one involving (c* — v?)u) falls off as the inverse 
square of the distance from the particle. If the velocity and acceleration are 
both zero, this term alone survives and reduces to the old electrostatic result 


1 q. 


g 4TE9 yr 
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For this reason, the first term in E is sometimes called the generalized 
Coulomb field. (Because it does not depend on the acceleration, it is also 
known as the velocity field.) The second term (the one involving 2 x (ux a)) 
falls off as the inverse first power of 2 and is therefore dominant at large 
distances. As we shall see in Chapter 11, it is this term that is responsible 
for electromagnetic radiation; accordingly, it is called the radiation field — 
or, since it is proportional to a, the acceleration field. The same terminology 
applies to the magnetic field. 

Back in Chapter 2, I remarked that if we could write down the formula 
for the force that one point charge exerts on another, we would be done with 
electrodynamics, in principle. That, together with the superposition princi- 
ple, would tell us the force exerted on a test charge Q by any configuration 
whatsoever. Well, here we are: Eqs. 10.72 and 10.73 give us the fields, and 
the Lorentz force law determines the resulting force: 


qQ 


~ Arey (2- uy 


[Ke —y)u+2x (ux a)] 


+ y x [2x [(? — v’)u+2x (ux a)]]}, (10.74) 


where V is the velocity of Q, and +, u, v, and a are all evaluated at the 
retarded time. The entire theory of classical electrodynamics is contained in 
that equation ... but you see why I preferred to start out with Coulomb’s law. 


Example 10.4. Calculate the electric and magnetic fields of a point 
charge moving with constant velocity. 


Solution. Putting a = 0in Eq. 10.72, 


q (È- v) 
~ Arey (A-W? 


In this case, using w = vi, 
au = cA — Ay = chr — Vt;) — c(t — t,)vy = c(r — vt). 


In Ex. 10.3 we found that 


AC-A-V=A-U= \(er-r-vy + (c2 — v?)(r2 — e£). 
In Prob. 10.17 you showed that this radical could be written as 
Rev 1 — v? sin? 0/2, 


where 
R=r-vt 


is the vector from the present location of the particle to r, and @ is the 
angle between R and v (Fig. 10.9). Thus 


q 1- vie? R 
Ameo (1 — v2 sin? O/c2)3/2 R? 


Er, t) = (10.75) 
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Notice that E points along the line from the present position of the par- 
ticle. This is an extraordinary coincidence, since the “message” came 
from the retarded position. Because of the sin? 0 in the denominator, 
the field of a fast-moving charge is flattened out like a pancake in the 
direction perpendicular to the motion (Fig. 10.10). In the forward and 
backward directions E is reduced by a factor (1 — v’/c”) relative to the 
field of a charge at rest; in the perpendicular direction it is enhanced 


by a factor 1/V1 — v2/c?. 


As for B, we have 


jai _ (r — vt) + (t-t,)v _ Bpr 
4 2 Lt c 


and therefore 


1 
B= 


= (4x E) = “wx 8). (10.76) 
C C 


Lines of B circle around the charge, as shown in Fig. 10.11. 

The fields of a point charge moving at constant velocity (Eqs. 10.75 
and 10.76) were first obtained by Oliver Heaviside in 1888.7! When 
v? < œ they reduce to 


Kcr, t) ~ 


l dg Ho q à 
—R, Bir, x ——— R). 10. 
pa pÊ Box yaC (10.77) 
The first is essentially Coulomb’s law, and the second is the “Biot- 
Savart law for a point charge” I warned you about in Chapter 5 
(Eq. 5.43). 


21 For history and references, see O. J. Jefimenko, Am. J. Phys. 62, 79 (1994). 
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Example 10.5. (The Thomson kink model) A point charge is at rest at 
the origin when, at time ¢ = 0, it is given a kick in the x direction, and 
flies off at constant speed v. Sketch the electric field at some later time. 
How about the magnetic field? 


Solution. Lets say the “kick” lasts a short time t (starting at t=0), 
during which the particle accelerates up to speed v. Outside a sphere 
of radius rz = ct, “news” of the kick has not yet arrived, and the field 
is that of a particle at rest at the origin (Fig. 10.12). Inside a sphere 
of radius r = c(t — T), the news has arrived, and E is the “flattened” 
field of a particle moving at constant velocity (Fig. 10.10). Within the 
shell between these spheres we don’t know precisely what the field looks 
like (because we haven’t been told exactly how the charge accelerates 
during the kick); but this much we do know: the field lines must join up. 
Within the shell (which, by the way, is expanding at the speed of light) 
the electric field has a “kink,” and in particular it acquires a significant 
tangential component. 

Meanwhile, the magnetic field is zero outside the outer sphere, 
and circles around the x-axis inside the inner sphere (Fig. 10.11). It 
is everywhere perpendicular to E (Eq. 10.73). Within the expanding 
shell the Poynting vector has a substantial radial component: energy is 
“radiating” away from the charge at the speed of light.” 


Problem 10.20. Derive Eq. 10.70. First show that 


SAEs aay (10.78) 


22 The inner sphere is centered at the location of the charge at time t = T. 

23 This lovely qualitative model for the electromagnetic radiation from an accelerating charge 
is due originally to J. J. Thomson, Electricity and Matter, Yale University Press, New Haven, 
CT (1904), p. 55. See also E. M. Purcell and D. J. Morin, Electricity and Magnetism, 3rd 
ed., Cambridge University Press (2013), Sect. 5.7 and App. H; R. Y Tsien, Am. J. Phys. 40, 
46 (1972); H. C. Ohanian, Am. J. Phys. 48, 170 (1980). 
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Problem 10.21. 


(a) Suppose a point charge is constrained to move along the x-axis. Show that the fields at 
points on the axis to the right of the charge are given by 


1 +v., 


(Do not assume v is constant!) What are the fields on the axis to the /eft of the charge? 

(b) In Ex. 10.41 noted that E is reduced in the forward and backward directions by a factor of 
(1 — (v/c)*), in the case of constant velocity. This might appear to contradict your result 
in (a). Resolve this “paradox.” 


Problem 10.22. For a point charge moving at constant velocity, calculate the flux integral 
$ E - da (using Eq. 10.75), over the surface of a sphere centered at the present location of 
the charge.” 


Problem 10.23. 


(a) Use Eq. 10.75 to calculate the electric field a distance d from an infinite straight wire carrying 
a uniform line charge A, moving at a constant speed v down the wire. 
(b) Use Eq. 10.76 to find the magnetic field of this wire. 


Problem 10.24. For the configuration in Prob. 10.16, find the electric and magnetic fields at 
the center. From your formula for B, determine the magnetic field at the center of a circular 
loop carrying a steady current Z, and compare your answer with the result of Ex. 5.6. 


More Problems on Chapter 10 
8 ________NVNu | 


Problem 10.25. Suppose you take a plastic ring of radius a and glue charge on it, so that 
the line charge density is Ag|sin(@/2)|. Then you spin the loop about its axis at an angular 
velocity œ. Find the (exact) scalar and vector potentials at the center of the ring. [Answer: 
A = (tloAqwa/37) {sin|w(t — alc)|x — cos[w(t — a/c)]¥}.] 


n" 


Problem 10.26. Figure 2.35 summarizes the laws of electrostatics in a “triangle diagram 
relating the source (p), the field (E), and the potential ( 7). Figure 5.51 does the same for mag- 
netostatics, where the source is J, the field is B, and the potential is A. Construct the analogous 
diagram for electrodynamics, with sources p and J (constrained by the continuity equation), 
fields E and B, and potentials V and A (constrained by the Lorenz gauge condition). Do not 
include formulas for V and A in terms of E and B. 


24 Wheeler’ First Moral Principle says “Never make a calculation until you know the answer” 
(E. F. Taylor and J. A. Wheeler, Spacetime Physics, Freeman (1966), p. 60). It doesn’t always 
work, but this is a good problem to try it on. 
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Problem 10.27. An expanding sphere, radius R(t) = vt (t > 0, constant v) carries a charge 
Q, uniformly distributed over its volume. Evaluate the integral 


Octr = f oema 
with respect to the center. Show that Oop ~ OC — 2), ifv<e. 


Problem 10.28. Check that the potentials of a point charge moving at constant velocity 
(Eqs. 10.49 and 10.50) satisfy the Lorenz gauge condition (Eq. 10.12). 


Problem 10.29. One particle, of charge q1, is held at rest at the origin. Another particle, of 
charge q2, approaches along the x-axis, in hyperbolic motion: 


x(t) = 4 b2 + (ct)?; 


it reaches the closest point, b, at time ¢ = 0, and then returns out to infinity. 


(a) What is the force F> on g> (due to q1) at time 7? 

(b) What total impulse (Z> = (ee F, dt) is delivered to q2 by q1? 

(c) What is the force F on q; (due to q2) at time 7? 

(d) What total impulse (7, = iM. F; dt) is delivered to qı by q2? [Hint: It might help to 
review Prob. 10.18 before doing this integral. Answer: In = —I, = q1q2/4€0b¢]. 


Problem 10.30. We are now in a position to treat the example in Section 8.2.1 quantitatively. 
Suppose qı isat x = —vtand q2 isat y = —vt (Fig. 8.3, with ¢ < 0). Find the electric and 
magnetic forces on qı and q2. Is Newton’s third law obeyed? 


Problem 10.31. A uniformly charged rod (length L, charge density A) slides out the x-axis 
at constant speed v. At time ¢ = O the back end passes the origin (so its position as a function 
of time is x = vt, while the front end is at x = vt + L). Find the retarded scalar potential at 
the origin, as a function of time, for ¢ > O. [First determine the retarded time ¢, for the back 
end, the retarded time ¢> for the front end, and the corresponding retarded positions x, and 
X.] Is your answer consistent with the Liénard—Wiechert potential, in the point charge limit 
(L < vt,with AL = q)? Do not assume v « c. 


Problem 10.32. A particle of charge q is traveling at constant speed v along the x-axis. 
Calculate the total power passing through the plane x = a, at the moment the particle itself 
is at the origin. [Answer: e v/32TE9a?] 


Problem 10.33. A particle of charge q; is at rest at the origin. A second particle, of charge 
q2, moves along the z-axis at constant velocity v. 


(a) Find the force F12 (Ż) of q1 on qn, at time ¢ (when qo is at z = vt). 
(b) Find the force F >; (z) of q2 on q1, at time z. Does Newton's third law hold, in this case? 


25 See J. J. G. Scanio, Am. J. Phys. 43, 258 (1975). 
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(c) Calculate the linear momentum p(‘) in the electromagnetic fields, at time z. (Don't bother 
with any terms that are constant in time, since you won't need them in part (d)). [Answer: 
(H09192/4701t)z.] 

(d) Show that the sum of the forces is equal to minus the rate of change of the momentum in 
the fields, and interpret this result physically. 


Problem 10.34. Develop the potential formulation for electrodynamics with magnetic charge 
(Eq. 7.53). [Hint: You'll need two scalar potentials and two vector potentials. Use the Lorenz 
gauge. Find the retarded potentials (generalizing Eqs. 10.26), and give the formulas for E and 
Bin terms of the potentials (generalizing Eqs. 10.2 and 10.3).] 


Problem 10.35. Find the (Lorenz gauge) potentials and fields of a time-dependent ideal 
electric dipole p(ż) at the origin.° (It is stationary, but its magnitude and/or direction are 
changing with time.) Don't bother with the contact term. [Answer: 


1 f 
V(r, t) = Te - [P + C/o p), 


A(t, t) = ci, 


Eq, =- 5 


(Ë -ÊC p) | 2 [P + 0o] -3E [p + ADD ) 


2 
C E 
_ _ Ho tx [P+ Cop] 
Bt.) =- A a 
(10.79) 
where all the derivatives of p are evaluated at the retarded time.] 


Problem 10.36. In Section 10.2.1 | noted that the naive “derivation” of the retarded potentials 
fails for the fields. The reason is subtle, but it can be fixed.” In the Lorenz gauge the potentials 
satisfy the inhomogeneous wave equation (Eq. 10.16): 


o, N = -s(r, 0), (10.80) 


where f stands for V or A, and s for the corresponding source (p/€o or uoJ). The solution 
(for localized sources) is 


=, 1 s(t’, ty) $ 
JT, = =Í 5 dt (10.81) 
(Eq. 10.26), as we guessed, and then confirmed. 


(a) Show (from Maxwell's equations) that 


WWE = Fi; + oe o°B = —po(V x J). (10.82) 
€9 ot 


26 W, J. M. Kort-Kamp and C. Farina, Am. J. Phys. 79, 111 (2011); D. J. Griffiths, Am. J. Phys. 
„ 79, 867 (2011). 


N. A. Lemos, Rev. Bras. Ensino Fis., 27, 385 (2005); available in English at 
arXiv:physics/0508047v1 [physics.ed-ph]. 
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(b) Write down the solutions for E(r, £) and B(r, £). 
(c) Using integration by parts, recover Jefimenko’s equations (10.36 and 10.38). 


Problem 10.37. Two charges +q approach the origin at constant velocity from opposite 
directions along the x-axis. They collide and stick together, forming a neutral particle at rest. 
Sketch the electric field before and shortly after the collision. How would you interpret the field 
after the collision, physically? 


Problem 10.38. Imagine an ideal electric dipole at the origin, pointing in the z direction, 
whose dipole moment is increasing linearly with time: 


p(t) = kt (10.83) 


(for some constant ~). Evidently, there is a steady current flowing from the negative end to the 
positive end: 


J(r) = að? (r)ĉ, (10.84) 
where a is a constant to be determined. 


(a) From the continuity equation, find the charge density o (r, £). Stipulate that there is no 
static background charge: p (r, 0) = O. [Express your answer as a derivative of the delta 
function.] 

(b) From the charge density, calculate the electric dipole moment, and, comparing Eq. 10.83, 
determine a (in terms of k). [Hint: Use integration by parts.] 

(c) When p(t, £) is linear in ¢, and J(r) is constant, a lucky cancelation means that the 

electric field satisfies Coulomb's law, and the magnetic field satisfies the Biot—Savart law, 

with no retardation (see Probs. 10.13 and 10.14). In our case, then, E(r, £) is given by the 
electrostatic formula (Eq. 3.106), with p the present dipole moment (Eq. 10.83). What is the 
magnetic field, Br, £)? 

Now suppose there is a second such dipole, located on the z-axis atz = a and pointing in 

the x direction. What is the net force of p on p>, and of p, on p,? Is Newton's third law 

obeyed? 

(e) Now the hard part: calculate the field momentum (Eq. 8.29) associated with their 
interaction, 


a. 
s 


Pem (2) = eo f Œ x B2) + (E2 x B))] dt, (10.85) 


and show that its rate of change exactly balances the net force on the two dipoles, so the 
total momentum is conserved.”* [Use a computer to help with the integrals.] 


28 This is a simplified version of O. Narayan, Am. J. Phys. 89, 1033 (2021), due to V. Onoochin. 
(The self-forces and self-field momenta are all zero.) 
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11.1 Dipole Radiation 


11.1.1 What Is Radiation? 


When charges accelerate, their fields can transport energy irreversibly out to 
infinity — a process we call radiation.! Let us assume the source is localized? 
near the origin; we would like to calculate the energy it radiates at time to. 
Imagine a gigantic sphere, out at radius r (Fig. 11.1). The power passing 
through its surface is the integral of the Poynting vector: 


Port = PS- da=— fEXB) da (11.1) 


Because electromagnetic “news” travels at the speed of light,* this energy 
actually left the source at the earlier time to = t — r/c, so the power radiated is 


Praa(to) = lim P(r, to + “) (11.2) 


(with to held constant). This is energy (per unit time) that is carried away 
and never comes back. 

Now, the area of the sphere is 47¢r7, so for radiation to occur the Poynting 
vector must decrease (at large r) no faster than 1/7? (if it went like 1/r, for 
example, then P would go like 1/r, and P;aq would be zero). According to 
Coulomb’s law, electrostatic fields fall off like 1/r? (or even faster, if the 
total charge is zero), and the Biot—Savart law says that magnetostatic fields 
go like 1/7? (or faster), which means that S ~ 1/74, for static configurations. 
So static sources do not radiate. But Jefimenko’s equations (Eqs. 10.36 and 
10.38) indicate that time-dependent fields include terms (involving p and J) 
that go like 1/r; these are the terms that are responsible for electromagnetic 
radiation. 


= 


In this chapter, the word “radiation” is used in a restricted technical sense — it might better be 
called “radiation to infinity.” In everyday language the word has a broader connotation. We 
speak, for example, of radiation from a heat lamp or an X-ray machine. In this more general 
sense, electromagnetic “radiation” applies to any fields that transport energy — which is to 
say, fields whose Poynting vector is nonzero. There is nothing wrong with that language, but 
it is not how I am using the term here. 

For nonlocalized configurations, such as infinite planes, wires, or solenoids, the concept of 
“radiation” must be reformulated (Prob. 11.28). 
More precisely, the fields depend on the status of the source at the retarded time. 


N 


w 


473 


11.1 Dipole Radiation 


Source 


The study of radiation, then, involves picking out the parts of E and B 
that go like 1/r at large distances from the source, constructing from them 
the 1/7? term in S, integrating over a large spherical* surface, and taking the 
limit as r > œ. IIl carry through this procedure first for oscillating electric 
and magnetic dipoles; then, in Section 11.2, we’ll consider the more difficult 
case of radiation from an accelerating point charge. 


11.1.2 Electric Dipole Radiation 


Picture two tiny metal spheres separated by a distance d and connected by 
a fine wire (Fig. 11.2); at time ¢ the charge on the upper sphere is g(t), and 
the charge on the lower sphere is —q(t). Suppose that we drive the charge 
back and forth through the wire, from one end to the other, at an angular 
frequency w: 


q(t) = qo cos(wt). (11.3) 
The result is an oscillating electric dipole: 
P(t) = po cos(wt)z, (11.4) 
where 
Po = qod 


is the maximum value of the dipole moment. 
The retarded potential (Eq. 10.26) is 


_ 1 sqocos[wa(t—2,/c)] qo cos[w(t — 2_/c)] 
V(r,t) = o a (11.5) 
where, by the law of cosines, 
As = Je F rd cos 0 + (d/2)?. (11.6) 


4 Tt doesn’t have to be a sphere, of course, but this makes the calculations a lot easier. 

5 It might occur to you that a more natural model would consist of equal and opposite charges 
mounted on a spring, say, so that g is constant while d oscillates, instead of the other way 
around. Such a model would lead to the same result, but moving point charges are hard to 
work with, and the proposed formulation is much simpler. 
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Now, to make this physical dipole into a perfect dipole, we want the 
separation distance to be extremely small: 


approximation 1: d < r. (11.7) 


Of course, if d is zero we get no potential at all; what we want is an 
expansion carried to first order in d. Thus 


d 
2. ~ r(1 F = cos 8). (11.8) 
It follows that 
1 1 d 
a © 7 (lt 3, 0084); (11.9) 


and 
cos[w(t — 2/c)] 
wd 
x cos| w(t —rie)+ Fe cos o] 
= cos[w(t — r/c)] cos( “4 cos 0) F sin[w(t — r/c)] sin(24 cos 0). 
2c 2c 
In the perfect dipole limit we have, further, 
approximation 2: d < L, (11.10) 


(Since waves of frequency w have a wavelength A = 2rc/w, this amounts 
to the requirement d « A.) Under these conditions, 


cos[w(t — 2,/c)] x cos[w(t — r/c)| F we cos 0 sin[w(t - r/o]. (11.11) 


Putting Eqs. 11.9 and 11.11 into Eq. 11.5, we obtain the potential of an 
oscillating perfect dipole: 


pocos 0 


Fosto 4T Eor 


{-Ž sin[w(e - re) + L cosol - rio}. (11.12) 
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In the static limit (w — 0) the second term reproduces the old formula for 
the potential of a stationary dipole (Eq. 3.102): 

_ pocos@ 

~ Aregr2 ` 
This is not, however, the term that concerns us now; we are interested in 
the fields that survive at large distances from the source, in the so-called 
radiation zone:° 


approximation 3: r > < (11.13) 


(or, in terms of the wavelength, r> A). In this region the potential 
reduces to’ 


_ Pow = 0 


PRE 4T EC 


; ) sin[w(t -= r/o]. (11.14) 


Meanwhile, the vector potential is determined by the current flowing in 


the wire: 

dq, : : 

I(t) = Te = ow sin(woe. (11.15) 
Referring to Fig. 11.3 (and Eq. 10.26), 


dh . 5 

— t-l 

Ar) = E iL adie y (11.16) 
T J-dl2 a 

Because the integration itself introduces a factor of d, we can, to first order, 


replace the integrand by its value at the center: 


_ Hopow 
4rnr 


A(r,0,t) = sin[w(t — r/c)]z. (11.17) 


6 Note that approximations 2 and 3 subsume approximation 1; all together, we have 
dÀ «r. 

7 Iam dropping the fastidious ~ in favor of =; it’s not the exact potential, but it is the relevant 
part of that potential, if you’re talking about radiation. 
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(Notice that whereas I implicitly used approximations 1 and 2, in keeping 
only the first order in d, Eq. 11.17 is not subject to approximation 3.) 
From the potentials, it is a straightforward matter to compute the fields: 


_ OMe bev 
~ Or +0 

=< Pow 
4TEgC 


VV 


{ cos of- sin[w(t — r/c)] - Z cos[w(t — rlc)])ê 


ae sin[w(t — ric)|6} 


2 
a (P) cosut- rioj. 


4reoe ` r 


(I dropped the first and last terms, in accordance with approximation 3.) 
Likewise, 
OA a Mopow 


ae or cos[w(t — r/c)|(cos OF — sin 00), 


and therefore 


E=-VV- = cos[w(t - r/c)]@. (11.18) 


ðA Lopow (gs 2) 


Ot 4r 


Meanwhile, 


1,8 ArI A 
VxA=—|2 (do) - 701? 


E ae {= sin 0 cos[w(t — r/c)] + =e sin|w(t— rle)]} @. 


The second term is again eliminated by approximation 3, so 


HoPow? (= 0 


B=VxA=- 
4ttc 


) cos[w(t - rie)\@. (11.19) 


Equations 11.18 and 11.19 represent monochromatic waves of frequency 
w traveling in the radial direction at the speed of light. The fields are in 
phase, mutually perpendicular, and transverse; the ratio of their amplitudes 
is Eo/Bo =c. All of this is precisely what we expect for electromagnetic waves 
in free space. (These are actually spherical waves, not plane waves, and 
their amplitude decreases like 1/r as they progress. But for large r, they are 
approximately plane over small regions — just as the surface of the Earth is 
reasonably flat, locally.) 

The energy radiated by an oscillating electric dipole is determined by the 
Poynting vector: 


2 . 
S(r, t) = te x B) = aS cos[w(t — rle)]) ê. (11.20) 
Ho i 


c 4n 
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The intensity is obtained by averaging (in time) over a complete cycle: 


Lopyw* sin? 0 
3272c ) 
Notice that there is no radiation along the axis of the dipole (here sin 0 = 0); 
the intensity profile takes the form of a donut, with its maximum in 
the equatorial plane (Fig. 11.4). The total power radiated is found by 
integrating (S) over a sphere of radius r: 


2,4 a) 
Lopov sin O > . 7 
ef z” sin 6 dO do = 


(S) = ( f. (11.21) 


r2 


Hopiwt 
127c¢ 


(P) = [Soa (11.22) 


Example 11.1. The strong frequency dependence of the power 
formula is what accounts for the blueness of the sky. Sunlight 
passing through the atmosphere stimulates atoms to oscillate as 
tiny dipoles. The incident solar radiation covers a broad range of 
frequencies (white light), but the energy absorbed and reradiated 
by the atmospheric dipoles is stronger at the higher frequencies 
because of the w* in Eq. 11.22. It is more intense in the blue, then, 
than in the red. It is this reradiated light that you see when you 
look up in the sky — unless, of course, you’re staring directly at 
the Sun. 

Because electromagnetic waves are transverse, the dipoles oscillate 
in a plane orthogonal to the Sun’s rays. In the celestial arc perpendic- 
ular to these rays, where the blueness is most pronounced, the dipoles 
oscillating along the line of sight send no radiation to the observer 
(because of the sin? @ in Eq. 11.21); light received at this angle is 
therefore polarized perpendicular to the Sun’s rays (Fig. 11.5). 

The redness of sunset is the other side of the same coin: Sunlight 
coming in along a tangent to the Earth’s surface must pass through 
a much longer stretch of atmosphere than sunlight coming from 
overhead (Fig. 11.6). Accordingly, much of the blue has been removed 
by scattering, and what’s left is red. 


8 The radial coordinate in Fig. 11.4 represents the magnitude of (S) in that direction. 
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Sun’s rays ———-» 


This dipole does not 


C2 radiate to the observer 


~ 


This dipole radiates 
to the observer 


Atmosphere (thickness grossly exaggerated) 


Problem 11.1. Check that the retarded potentials of an oscillating dipole (Eqs. 11.12 and 11.17) 
satisfy the Lorenz gauge condition. Do not use approximation 3. 


Problem 11.2. Equation 11.14 can be expressed in “coordinate-free’ form by writing 
Po cos O = po - Ê. Do so, and likewise for Eqs. 11.17, 11.18, 11.19, and 11.21. 


Problem 11.3. Find the radiation resistance of the wire joining the two ends of the dipole. 
(This is the resistance that would give the same average power loss — to heat — as the oscillating 
dipole in fact puts out in the form of radiation.) Show that R = 790 (d/A)*Q, where A is 
the wavelength of the radiation. For the wires in an ordinary radio (say, d = 5 cm), should 
you worry about the radiative contribution to the total resistance? 


Problem 11.4. A rotating electric dipole can be thought of as the superposition of two oscillat- 
ing dipoles, one along the x-axis and the other along the y-axis (Fig. 11.7), with the latter out 
of phase by 90°: 

P = Po[cos(w?t)x + sin(w)y]. 


Using the principle of superposition and Eqs. 11.18 and 11.19 (perhaps in the form suggested by 
Prob, 11.2), find the fields of a rotating dipole. Also find the Poynting vector and the intensity of 
the radiation. Sketch the intensity profile as a function of the polar angle ©, and calculate the 
total power radiated. Does the answer seem reasonable? (Note that power, being quadratic in 
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the fields, does not satisfy the superposition principle. In this instance, however, it seems to. How 
do you account for this? You might want to refer to the “caution” at the end of Section 8.2.4.) 


yh 


4 


Fig. 11.7 


11.1.3 Magnetic Dipole Radiation 


Suppose now that we have a wire loop of radius b (Fig. 11.8), around which 
we drive an alternating current: 


I(t) = h cos(wt). (11.23) 
This is a model for an oscillating magnetic dipole, 
m(t) = 7b71(t)Z = m cos(wt)2, (11.24) 
where 
mo = nb’ I (11.25) 


is the maximum value of the magnetic dipole moment. 
The loop is uncharged, so the scalar potential is zero. The retarded vector 
potential is 


A(r, ft) = 


Ho f Hoet Woly (11.26) 


4r 2 

For a point r directly above the x-axis (Fig. 11.8), A must aim in the 
y direction, since the x components from symmetrically placed points on 
either side of the x-axis will cancel. Thus 


L 2n Z 
At, t) = wer costo 710) coso’ dd’ (11.27) 
0 


(cos ’ serves to pick out the y-component of dl’). By the law of cosines, 


L= yr +b? — 2rbcos Y, 
where w is the angle between the vectors r and b: 
r=rsinO0x+rcos0z, b=bcosd’x+bsing’y. 


So rb cos y =r-b = rbsin 6 cos @’, and therefore 


a= fP? + b2 2rbsin 6 cos Q. (11.28) 
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For a “perfect” dipole, we want the loop to be extremely small: 
approximation 1: b< r. (11.29) 
To first order in b, then, 
ax r(1 al sin 0 cos p’) 
r 2 
so 


l b 
x at + = sin 6 cos ¢’) (11.30) 


vie 


and 
wb . 
cos[w(t — 2/c)] = cos| w(t — rlc) + Fa sin 0 cos ¢’| 
= cos[w(t — r/c)] cos( 2? sin 0 cos o’) 
— sin[w(t — r/c)] sin( we sin 0 cos p’). 


As before, we also assume the size of the dipole is small compared to the 
wavelength of the radiation: 


approximation 2: b < > (11.31) 
In that case, 
b 
cos[w(t — 2/c)] x cos[w(t - rio] - Z sin 0 cos Q’ sin[w(t— r/c)]. (11.32) 


Inserting Eqs. 11.30 and 11.32 into Eq. 11.27, and dropping the second- 
order term, 


pooh, f” . , 
[w(t — b 
dnp? i { cos[a( r/c)| + bsin 8 cos @ 


A(r, f) ~ 


x (t cos[æ(t — r/c)] — Z sin[w(t — ric)|)} cos’ dd’. 


The first term integrates to zero: 


2m 
f cos ġ' dd’ = 0. 
0 
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The second term involves the integral of cosine squared: 


2m 
f co? p’ dp’ =n. 
0 


Putting this in, and noting that in general A points in the Q direction, 
I conclude that the vector potential of an oscillating perfect magnetic 
dipole is 


m] r 
Eor (= J cos[w(t—r/c)] - Z sin[w(t—- rio} @. (11.33) 


In the static limit (w = 0) we recover the familiar formula for the potential 
of a magnetic dipole (Eq. 5.87) 


A(r, 8, t) = 7 


lo mosin 0 ~ 
Aaro Dn 


In the radiation zone, 
approximation 3: r > F (11.34) 


the first term in A is negligible, so 


_ omw (= 0 


Ar, 0,1) =- 


) sinjol -riolo (11.35) 


From A we obtain the fields at large r: 


ðA Homa? ;sin @ . 
taS Ot i. 4nc ( r ) cos[aw(t - rie)]@, (11.36) 
and 
Loma sin 9 ` 
B=VxA=- ee ( ) cos[w(t — r/c)]0. (11.37) 


r 


(I used approximation 3 in calculating B.) These fields are in phase, mutually 
perpendicular, and transverse to the direction of propagation (f), and 
the ratio of their amplitudes is Æọ/Bọ =c, all of which is as expected for 
electromagnetic waves. They are, in fact, remarkably similar in structure to 
the fields of an oscillating electric dipole (Eqs. 11.18 and 11.19), only this 
time it is B that points in the @ direction and E in the p direction, whereas 
for electric dipoles it’s the other way around. 
The energy flux for magnetic dipole radiation is 


scr, ) = -xB) = Ei 
Lo 


2 o 2 
mow (= 2) cos|[w(t = rle)|\"®, (11.38) 


c \ 4ne 


the intensity is 


254 a J 
Hom", sin 0, 
(S) = a eee (11.39) 


r2 
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and the total radiated power is 


(11.40) 


Once again, the intensity profile has the shape of a donut (Fig. 11.4), 
and the power radiated goes like w*. There is, however, one important dif- 
ference between electric and magnetic dipole radiation: for configurations 
with comparable dimensions, the power radiated electrically is enormously 
greater. Comparing Eqs. 11.22 and 11.40, 


P magnetic = ae (11.41) 
poc 

where (remember) mo = nb? lo, and Po = qod. The amplitude of the current 

in the electrical case was Jp = qow (Eq. 11.15). Setting d = nb, for the sake 


of comparison, I get 


P electric 


P magnetic = eae (11.42) 


Pretectric c 


But wblc is precisely the quantity we assumed was very small (approxima- 
tion 2), and here it appears squared. Ordinarily, then, one should expect 
electric dipole radiation to dominate. Only when the system is carefully 
contrived to exclude any electric contribution (as in the case just treated) 
will the magnetic dipole radiation reveal itself. 


Problem 11.5. Calculate the electric and magnetic fields of an oscillating magnetic dipole 
without using approximation 3. [Do they look familiar? Compare Prob. 9.37.] Find the Poynting 
vector, and show that the intensity of the radiation is exactly the same as we got using 
approximation 3. 


Problem 11.6. Find the radiation resistance (Prob. 11.3) for the oscillating magnetic dipole in 
Fig. 11.8. Express your answer in terms of A and b, and compare the radiation resistance of the 
electric dipole. [Answer: 3 x 10°(b/A)4Q.] 


Problem 11.7. Use the “duality” transformation Prob. 7.68, together with the fields of an 
oscillating electric dipole (Eqs. 11.18 and 11.19), to determine the fields that would be produced 
by an oscillating “Gilbert” magnetic dipole (composed of equal and opposite magnetic charges, 
instead of an electric current loop). Compare Eqs. 11.36 and 11.37, and comment on the result. 


11.1.4 Radiation from an Arbitrary Source 


In the previous sections, we studied the radiation produced by two specific 
systems: oscillating electric dipoles and oscillating magnetic dipoles. Now 
I want to apply the same reasoning to a configuration of charge and current 
that is entirely arbitrary, except that it is localized within some finite volume 
near the origin (Fig. 11.9). The retarded scalar potential is 
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=y 


1 ' t-l 
Ve) = f pir - Dae (11.43) 
yaVP4r2 T. (11.44) 


As before, we shall assume that the field point r is far away, in comparison 
to the dimensions of the source: 


where 


approximation 1: 1’ <r. (11.45) 


(Actually, r’ is a variable of integration; approximation 1 means that the 
maximum value of r’, as it ranges over the source, is much less than r.) On 
this assumption, 


ax r(1- z ), (11.46) 
SO 

La “(1+ UF) (11.47) 
and 


r Êr 
",t-2lc) x t- -+ —]). 
pr, le) plr’, zt z ) 
Expanding p as a Taylor series in ¢ about the retarded time at the origin, 
t =t--, (11.48) 
C 
we have 


pr’, t— 2c) = p(t, to) + pa", w(=5) PEA (11.49) 


where the dot signifies differentiation with respect to time. The next terms 
in the series would be 


1 ./f-r’y2 1... Fer’ \3 
fC: md ore a 


We can afford to drop them, provided 


Cc c Cc 


< ee . 3 ... . 2 .... . - Modlit i 
|p/p| [öp [pòl 


approximation 2: 7’ 


(11.50) 
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For an oscillating system, each of these ratios is c/w, and we recover the 
old approximation 2. In the general case it’s more difficult to interpret 
Eq. 11.50, but as a procedural matter approximations | and 2 amount to 
keeping only the first-order terms inr’. 

Putting Eqs. 11.47 and 11.49 into the formula for V (Eq. 11.43), and 
again discarding the second-order term, 


1 f f d 
x A A ae f á F. a pele "d a d i 
V(r,t) galf ee ,to)dT +E fr p(t’, to) dt t £ fr p(t’, to) dt 


The first integral is simply the total charge, Q, at time tọ. Because charge is 
conserved, Q is independent of time. The other two integrals represent the 
electric dipole moment at time tọ. Thus 


V(r,t) = 


l (2 ee) (11.51) 


4negt r r2 


In the static case, the first two terms are the monopole and dipole contri- 
butions to the multipole expansion for V; the third term, of course, would 
not be present. 

Meanwhile, the vector potential is 


amps E [A dv’. (11.52) 


As you'll see in a moment, to first order in 7’ it suffices to replace 2 by r in 
the integrand: 


AT, À) x E fae war. (11.53) 


According to Prob. 5.7, the integral of J is the time derivative of the dipole 
moment, so 

_ Ho p(to) 
“anor 


A(r, t) (11.54) 


Now you see why it was unnecessary to carry the approximation of 2 
beyond the zeroth order (2~r): p is already first order in r’, and any 
refinements would be corrections of second order (or higher). 

Next we must calculate the fields. Once again, we are interested in the 
radiation zone (that is, in the fields that survive at large distances from the 
source), so we keep only those terms that go like 1/r: 


approximation 3: discard 1/r terms in E and B. (11.55) 


For instance, the Coulomb field, 


1 Q, 


= =f 
Amey r?” 


coming from the first term in Eq. 11.51, does not contribute to the electro- 
magnetic radiation. In fact, the radiation comes entirely from those terms 
in which we differentiate the argument fp. From Eq. 11.48 it follows that 


1 1. 
Vh = --Vr = --f, 
Cc Cc 
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and hence 
1 F- p(t) 1 pÊ- p(to) 1 [F- PC) 
VWVzxV x% Vip =- 
Fees rc ÅT E0 | rc ] 4T Eoc? r 
Similarly, 
TxA m EIT x plto)] = E) x Bo) = -7E x Co) 
4r 4rr 4r í 
while 
OA = Ho p(to) 
Ot 4n r 
So 
Et, D) ~ EE Di- pl = Sx exp) (11.56) 
4r í 
where p is evaluated at time to = t — r/c, and 
AO Lo maoo 
Bar, f) ~ ree [r x p]. (11.57) 


In particular, if we use spherical polar coordinates, with the z-axis in the 
direction of p(to), then 


se . , 
E(r, 0, 1) = PPO ya, 
4n r 
(11.58) 
Lop(to) sin O 
BOHA 4nc ( r )9. 
The Poynting vector is 
sin’ 0 
S(r,1) = TE xB) = (too)? (= me, (11.59) 
the power passing om a giant spherical surface at radius r is 
a : ry 72 
P = da = ——|p(t-—- 
DTE $ Se, o) -da = Eöl- S). 
and the total radiated power (Eq. 11.2) is 
Pralto) = PON. (11.60) 


6nce 


Notice that E and B are mutually perpendicular, transverse to the direction 
of propagation (f), and in the ratio E/B = c, as always for radiation fields. 


Example 11.2. 


(a) In the case of an oscillating electric dipole, 
P(t) = pocos(wt), p(t) = —w*po cos(w?), 


and we recover the results of Section 11.1.2. 
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(b) For a single point charge q, the dipole moment is 
p(t) = qd(t), 
where d is the position of q with respect to the origin. Accordingly, 
p(t) = a(t), 


where a is the acceleration of the charge. In this case the power 
radiated (Eq. 11.60) is 


z poga? 
~ 6ne ` 


P (11.61) 


This is the famous Larmor formula; I'll derive it again, by rather 
different means, in the next section. Notice that the power radiated 
by a point charge is proportional to the square of its acceleration. 


What I have done in this section amounts to a multipole expansion of 
the retarded potentials, carried to the lowest order in r’ that is capable 
of producing electromagnetic radiation (fields that go like 1/r). This turns 
out to be the electric dipole term. Because charge is conserved, an electric 
monopole does not radiate — if charge were not conserved, the first term in 
Eq. 11.51 would read 

1 Q(t) 


4TE9 r 


Vmono = 


and we would get a monopole field proportional to I/r: 
Emono = : D0) 
Anmeéegc r 

You might think that a charged sphere whose radius oscillates in and out 
would radiate, but it doesn’t — the field outside, according to Gauss’s law, is 
exactly (Q/4reor’)f, regardless of the fluctuations in size. (In the acoustical 
analog, by the way, monopoles do radiate: witness the croak of a bullfrog.) 

If the electric dipole moment should happen to vanish (or, at any rate, if 
its second time derivative is zero), then there is no electric dipole radiation, 
and one must look to the next term: the one of second order in r’. As 
it happens, this term can be separated into two parts, one of which is 
related to the magnetic dipole moment of the source, the other to its electric 
quadrupole moment. (The former is a generalization of the magnetic dipole 
radiation we considered in Section 11.1.3.) If the magnetic dipole and 
electric quadrupole contributions vanish, the (r’)* term must be considered. 
This yields magnetic quadrupole and electric octopole radiation ... and so 
it goes. 


Problem 11.8. A parallel-plate capacitor C, with plate separation d, is given an initial charge 
(+) Qo. It is then connected to a resistor R, and discharges, O(t) = Qoe @/RC. 
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(a) What fraction of its initial energy (Q5/2 C) does it radiate away? 
(b) If C = 1 pF, R= 1000 Q,andd = 0.1 mm, whatis the actual number? In electronics 
we don't ordinarily worry about radiative losses; does that seem reasonable, in this case? 


Problem 11.9. Apply Eqs. 11.59 and 11.60 to the rotating dipole of Prob. 11.4. Explain any 
apparent discrepancies with your previous answer. 


Problem 11.10. An insulating circular ring (radius b) lies in the xy plane, centered at the 
origin. It carries a linear charge density A = Ag sin @, where Aq is constant and ¢ is the 
usual azimuthal angle. The ring is now set spinning at a constant angular velocity w about the 
z-axis. Calculate the power radiated. 


Problem 11.11. A current Z(+) flows around the circular ring in Fig. 11.8. Derive the general 
formula for the power radiated (analogous to Eq. 11.60), expressing your answer in terms of the 
magnetic dipole moment, 772(¢), of the loop. (Answer: P = uoh? lne] 


11.2 Power Radiated by a Point Charge 


hee 
11.2.1 The Larmor Formula 


In Chapter 10 we derived the fields of a point charge q in arbitrary motion 
(Eqs. 10.72 and 10.73): 


E(r, t) = (° -v)u +2 x (uxa)], (11.62) 


A 
4TEo (2-u)3 
where u = cê — v, and 
1. 
Br, t) = Fas x E(r, £). (11.63) 
The first term in Eq. 11.62 is the velocity field, and the second one (with the 
triple cross product) is the acceleration field. 
The Poynting vector is 
1 1 a 1 ie ee 
S = — (Œ x B) = — [E x (ê x E)] = —[F’°4- (2- E)E]. (11.64) 
Ho Hoc Hoc 
However, not all of this energy flux constitutes radiation; some of it is just 
field energy carried along by the particle as it moves. The radiated energy is 
the stuff that, in effect, detaches itself from the charge and propagates off 
to infinity. (It’s like flies breeding on a garbage truck: some of them hover 
around the truck as it makes its rounds; others fly away and never come 
back.) To calculate the total power radiated by the particle at time t, we 
draw a huge sphere of radius + (Fig. 11.10), centered at the position of the 
particle (at time £). At the later time 


a 
tp=t+ 2 (11.65) 
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the radiation reaches the sphere, and at that moment we integrate the 
Poynting vector over the surface.’ Notice that for all points on the sphere 
(at time ¢;), £ = t, — 2/c is precisely the retarded time (Eq. 10.25), so in 
evaluating the fields we'll need the position, velocity, and acceleration of 
the charge at time t, when the radiation left the source. 

Now, the area of the sphere is proportional to 2”, so any term in S that 
goes like 1/2? will yield a finite answer, but terms like 1/2? or 1/2* will 
contribute nothing in the limit 2 — oo. For this reason, only the accel- 
eration fields represent true radiation (hence their other name, radiation 
fields): 

_ 4 
~ Ameo (2 - u)? 


rad 


[2x (ux a]. (11.66) 


The velocity fields carry energy, to be sure, and as the charge moves this 
energy is dragged along — but it’s not radiation. (It’s like the flies that stay 
with the garbage truck.) Now E,aq is perpendicular to 2, so the second term 
in Eq. 11.64 vanishes: 


1 se 
Sana = pe Ene (11.67) 
If the charge is instantaneously at rest (at time t), then u = c%, and 
q NEOPA 09 i3 ~ 
Ead = = LIA. aĝ- a]. 11. 
rad ae (2X a)] aa a)% — a] (11.68) 
In that case, 

© l puos? os na HoP E ;sin? Oh, 

Srad = ao. P- -ah a r (11.69) 


where @ is the angle between % and a. No power is radiated in the forward 
or backward direction — rather, it is emitted in a donut about the direction 
of instantaneous acceleration (Fig. 11.11). The total power radiated (at 
time £) is 


22 . 2 
_ _ pofa sin 0 Fes 
P= Sno da = ef 7 2“ sin 0 d0 do 


? There is a subtle change in strategy here: in Section 11.1 we worked from a fixed point (the 
origin), but here it is more appropriate to use the (moving) location of the charge. The 
implications of this change in perspective will become clearer in a moment. 
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(where a is the acceleration at time f) or 


(11.70) 


This, again, is the Larmor formula, which we obtained earlier by a quite 
different route (Eq. 11.61). Although I derived Eqs. 11.69 and 11.70 on the 
assumption that v = 0, they actually hold to good approximation as long 
asy < ¢,!0 

I'll treat the general case (v + 0) in a moment, but first I want to call 
your attention to the connection between radiation and acceleration, as 
spotlighted by the Larmor formula. A particle at rest does not radiate; nor 
does a particle moving at constant velocity ... but if it’s accelerating, then 
it radiates. Wait a minute! What about a uniformly charged circular ring 
(radius R) rotating at a constant angular velocity (w) about its axis? That’s 
a steady current, the electric and magnetic fields are static, and it certainly 
doesn’t radiate. And yet, every point on the ring is accelerating (a = w? R). 
Or consider a uniformly charged spherical shell (radius R) that “breathes” 
in and out sinusoidally (R(t) = Ro cos wt); for r > Ro the electric field is 
Coulombic, and the magnetic field is zero everywhere. No radiation ... and 
yet every point on the shell is accelerating. Doesn’t the Larmor formula 
say that acceleration generates radiation? Yes: but the Larmor formula 
applies to point charges, whereas these are extended objects. Remember, 
the Poynting vector does not satisfy the superposition principle — the total 
power radiated is not the sum of the powers that would be radiated by each 
electron separately. If you like, what we have in these examples is perfect 
destructive interference in every direction.!! 


11.2.2 The Liénard Formula 


The exact treatment of the general case (v + 0) is harder, both for the obvi- 
ous reason that E;aq is more complicated, and also for the subtler reason 


10 Tn the context of special relativity, the condition v = 0 simply represents an astute choice 
of reference frame, with no essential loss of generality. If you can figure out how P 
transforms, you can deduce the general (Liénard) result from the v = 0 (Larmor) formula 
(see Prob. 12.74). 

11 T, A. Abbott and D. J. Griffiths, Am. J. Phys. 53, 1203 (1985). 
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that Sraa, the rate at which energy passes through the sphere (Fig. 11.10) is 
not the same as the rate at which that energy left the particle. To understand 
why, imagine someone is firing a stream of bullets out the window of a 
moving car (Fig. 11.12). The rate N; at which the bullets strike a stationary 
target is not the same as the rate N, at which they left the gun, because of 
the motion of the car. In fact, you can easily check that N, = (1 — w/e)N;, 
if the car is moving toward the target, and 


Ny = (1- ZYN, 


for arbitrary directions (here v is the velocity of the car, c is the speed of the 
bullets — relative to the ground — and 2 is a unit vector from car to target). 
More formally, if dW/dt, is the rate at which energy passes through the 
sphere at radius 7, then the rate at which energy left the charge was 


dt dt; ðt \ ac’ dt,’ 
(I used Eq. 10.78 to get 07/dt,.) But 


au, hey 
a eS 
which is precisely the ratio of N, to N; it’s a purely geometrical factor (the 
same as in the Doppler effect). 
The power radiated by the particle into a patch of area 2’ sin O d0 dọ = 
%2? dQ on the sphere is therefore given by 


dP aw l » » gq |ax(uxa)|? 

——= SS Fi i= x > 11.72 
dQ ( àc ) uoc 4 l6me9 (ĉ.u)5 ( 
where dQ = sin 0 d@ dọ is the solid angle into which this power is radiated. 
Integrating over 0 and ¢ to get the total power radiated is no picnic, and 
for now I shall simply quote the answer: 


dP . — po?’ yoy a yxa 
p= | Ododo = T (a - | i p? (11.73) 


where y = 1/V1 — v?/¢?. This is Liénard’s generalization of the Larmor 
formula (to which it reduces when v « c). The factor yê means that the 
radiated power increases enormously as the particle velocity approaches 
the speed of light. 
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Example 11.3. Suppose that v and a are instantaneously collinear (at 
time 4), as (for example) in straight-line motion. Find the angular 
distribution of the radiation (Eq. 11.72) and the total power emitted. 


Solution. In this case (u x a) = c(2 x a), so 
dP_ Pe |ax(axa)/’ 
dQ lore) (c-ê.v)5 ` 

Now 

âx (&Xa) =(2-a)a—-a, soļ|êx(ĉxa)|? =a - (A-a)’. 

In particular, if we let the z-axis point along v, then 
dP uoga? sin’ O 
dQ 16r2c (1 — b cos 0)5’ 


(11.74) 


where 6 = v/c. This is consistent, of course, with Eq. 11.69, in 
the case v = 0. However, for very large v (£ ~ 1) the donut of 
radiation (Fig. 11.11)is stretched out and pushed forward by the factor 
(1 — Bcos @)~, as indicated in Fig. 11.13. Although there is still no 
radiation in precisely the forward direction, most of it is concentrated 
within an increasingly narrow cone about the forward direction (see 
Prob. 11.15). 
The total power emitted is found by integrating Eq. 11.74 over all 

angles: 

fe sin? O 

~ l6r?e (1 — B cos 0)5 
The ¢ integral is 27; the @ integral is simplified by the substitution 


x = cos 0: 
2,2 pdt ee 
p= Hae f ita a 
8c Ja (1- fx) 


Integration by parts yields $(1 — B*)~3, and I conclude that 


sin 0 d0 dọ. 


Hogary® 
6nc ` 


P= (11.75) 


This result is consistent with the Liénard formula (Eq. 11.73), for 
the case of collinear v and a. Notice that the angular distribution 
of the radiation is the same whether the particle is accelerating or 
decelerating; it only depends on the square of a, and is concentrated 
in the forward direction (with respect to the velocity) in either case. 
When a high speed electron hits a metal target it rapidly decelerates, 
giving off what is called bremsstrahlung, or “braking radiation.” What 
I have described in this example is essentially the classical theory of 
bremsstrahlung. 


Problem 11.12. An electron is released from rest and falls under the influence of gravity. In the 
first centimeter, what fraction of the potential energy lost is radiated away? 
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Problem 11.13. A positive charge q is fired head-on at a distant positive charge Q (which is 
held stationary), with an initial velocity vo. It comes in, decelerates to v = O, and returns out 
to infinity. What fraction of its initial energy (5m) is radiated away? Assume vp < c, and 
that you can safely ignore the effect of radiative losses on the motion of the particle. [Answer: 


(16/45) (q/Q)(volc)?] 


Problem 11.14. In Bohr’s theory of hydrogen, the electron in its ground state was supposed 
to travel in a circle of radius 5 x 10-!! m, held in orbit by the Coulomb attraction of the 
proton. According to classical electrodynamics, this electron should radiate, and hence spiral in 
to the nucleus. Show that v < c for most of the trip (so you can use the Larmor formula), and 
calculate the lifespan of Bohr’s atom. (Assume each revolution is essentially circular.) 


Fig. 11.13 


ah | 


Fig. 11.14 Fig. 11.15 


Problem 11.15. Find the angle Omax at which the maximum radiation is emitted, in Ex. 11.3 
(Fig. 11.13). Show that for ultrarelativistic speeds (v close to c), 


Omax 7 4{(1 - p)/2. 


What is the intensity of the radiation in this maximal direction (in the ultrarelativistic case), 
in proportion to the same quantity for a particle instantaneously at rest? Give your answer in 
terms of y. 


Problem 11.16. In Ex. 11.3 we assumed the velocity and acceleration were (instantaneously, at 
least) collinear. Carry out the same analysis for the case where they are perpendicular. Choose 
your axes so that v lies along the z-axis and a along the x-axis (Fig. 11.14), so that v = vz, 
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a = aĝ, and 2 = sin 0 cos Q $ + sin 0 sin d y + cos 0 2. Check that P is consistent 
with the Liénard formula. [Answer: 

dP uoga [(1 — B cos 0)? — (1 — 6°) sin? 0 cos? ġ] pe uoga? yt 
dQ  16rn2c (1 — f cos 0)’ j © óne ` 


For relativistic velocities (6 ~ 1) the radiation is again sharply peaked in the forward direction 
(Fig. 11.15). The most important application of these formulas is to circular motion — in this case 
the radiation is called synchrotron radiation. For a relativistic electron, the radiation sweeps 
around like a locomotive's headlight as the particle moves. ] 


11.3 The Radiation Reaction 
E] 


When a charged particle accelerates, it radiates. This radiation carries off 
energy, which must come at the expense of some other form — kinetic, 
potential, or energy stored in the nonradiative fields of the particle. Under 
the influence of a given force, therefore, the acceleration of a charged 
particle is typically less than — or at any rate different from — that of a 
neutral particle of the same mass. Evidently, the fields exert a force (Fraa) 
back on the charge — a recoil force, rather like that of a bullet on a gun. In 
the next section II derive this radiation reaction force from conservation of 
energy. Then, in the following section, I'll show you the actual mechanism 
responsible, and derive the reaction force again in the context of a simple 
model. 


11.3.1 The Abraham—Lorentz Formula 


For a nonrelativistic particle (v « c), the total power radiated is given by 
the Larmor formula (Eq. 11.70): 


E uoga? 
6nc ` 


(11.76) 


Conservation of energy suggests that this is also the rate at which the 
particle loses energy, under the influence of the radiation reaction force Faa: 
E uoga? 

6nc ` 


Faa V= (11.77) 


I say “suggests” advisedly, because this equation is actually wrong. For 
we calculated the radiated power by integrating the Poynting vector over 
a sphere of “infinite” radius; in this calculation the velocity fields played 
no part, since they fall off too rapidly as a function of 2 to make any 
contribution. But these “local” fields do carry energy — they just don’t 
transport it out to infinity. As the particle accelerates and decelerates, energy 
is exchanged between it and the nearby fields, at the same time as energy 
is irretrievably radiated away by the acceleration fields. Equation 11.77 
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accounts only for the latter, but if we want to know the recoil force exerted 
by the fields on the charge, we need to consider the total power lost at any 
instant, not just the portion that eventually escapes in the form of radiation. 
(The term “radiation reaction” is a misnomer. We should really call it the 


field reaction. In fact, we’ll soon see that F,aq is determined by the time 


derivative of the acceleration and can be nonzero even when the acceleration 
itself is instantaneously zero, and hence the particle is not radiating.) 

The energy lost by the particle in any given time interval, then, must 
equal the energy carried away by the radiation p/us whatever extra energy 
has been pumped into the nearby fields.'!* However, if we agree to consider 
only intervals over which the system returns to its initial state, then the 
energy in the velocity fields is the same at both ends, and the only net loss 
is in the form of radiation. Thus Eq. 11.77, while incorrect instantaneously, 
is valid on the average: 


12 2) to 
f Find vdt = er f a dt, (11.78) 
ti TTC ti 


with the stipulation that the state of the system is identical at t, and t. 
In the case of periodic motion, for instance, we must integrate over an 
integral number of full cycles.!? Now, the right-hand side of Eq. 11.78 can 
be integrated by parts: 


2 2 dys „dy dy 
f eaf O Dae D 


The boundary term drops out, since the velocities and accelerations are 
identical at ¢; and t), so Eq. 11.78 can be written equivalently as 


3 2 
f (Faa - EL a) -vdr = 0. (11.79) 
ti 


to t2 dy 


— — -vdt. 
ti n at? 


6nc 
Equation 11.79 will certainly be satisfied if 


(11.80) 


This is the Abraham-Lorentz formula!4 for the radiation reaction force. 


12 Actually, while the total field is the sum of velocity and acceleration fields, E = E, + E4, the 
energy is proportional to E? = E? +2E, - E4 + EŻ and contains three terms: energy stored in 
the velocity fields alone (E2), energy radiated away (E2), and a cross term E, - Eg. For the 
sake of simplicity, I’m referring to the combination (E? + 2Ey - Eq) as “energy stored in the 
nearby fields.” These terms go like 1/24 and 1/23, respectively, so neither one contributes to 
the radiation. 
For nonperiodic motion the condition that the energy in the velocity fields be the same at 
tı and fy is more difficult to achieve. It is not enough that the instantaneous velocities and 
accelerations be equal, since the fields farther out depend on v and a at earlier times. In 
principle, then, v and a and all higher derivatives must be identical at tı and t2. In practice, 
since the velocity fields fall off rapidly with ~, it is sufficient that v and a be the same over 
a brief interval prior to ¢ and t2. 
l4 This formula was first obtained by Lorentz in 1892 using the self-force method 
(Section 11.3.2); Planck pioneered the conservation-of-energy approach; Abraham 
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Of course, Eq. 11.79 doesn’t prove Eq. 11.80. It says nothing whatever 
about the component of F,,q perpendicular to v, and it only tells you the 
time average of the parallel component — the average, moreover, over very 
special time intervals. As we’ll see in the next section, there are other reasons 
for believing in the Abraham—Lorentz formula, but for now, the best that 
can be said is that it represents the simplest form the radiation reaction force 
could take, consistent with conservation of energy. 

The Abraham—Lorentz formula has disturbing implications, which are 
not entirely understood more than a century after the law was first pro- 
posed. For suppose a particle is subject to no external forces; then Newton’s 
second law says 


2 
Fiad = ki å = ma, 
6nce 
from which it follows that 
a(t) = ae", (11.81) 
where 
2 
22. (11.82) 
67mc 


(In the case of the electron, t = 6x107% s.) The acceleration spontaneously 
increases exponentially with time! This absurd conclusion can be avoided 
if we insist that aọ = 0, but it turns out that the systematic exclusion of 
such runaway solutions has an even more unpleasant consequence: if you 
do apply an external force, the particle starts to respond before the force 
acts! (See Prob. 11.19.) This acausal preacceleration jumps the gun by only 
a short time t; nevertheless, it is (to my mind) unacceptable that the theory 
should countenance it at all.!5 


Example 11.4. Calculate the radiation damping of a charged particle 
attached to a spring of natural frequency wo, driven at frequency w. 


Solution. The equation of motion is 


oe 2 eee 
mx = Fepring + Frad + Fariving = —M@9xX + MTX + Fariving- 


With the system oscillating at frequency w, 


x(t) = xo cos(wt + Ô), 


polished both arguments, and generalized them to higher velocity; Dirac (1938) published a 

fully relativistic account. For the full history see K. T. McDonald, http://kirkmed.princeton 
.edu/examples/selfforce.pdf. 
These difficulties persist in the relativistic version of the Abraham—Lorentz equation, 
which can be derived by starting with Liénard’s formula instead of Larmor’s (Prob. 12.75). 
Perhaps they are telling us that there can be no such thing as a point charge in classical 
electrodynamics, or maybe they presage the onset of quantum mechanics. For guides to 
the literature, see P. Pearle’s chapter in D. Teplitz, ed., Electromagnetism: Paths to Research, 
Plenum, New York (1982) and F. Rohrlich, Am. J. Phys. 65, 1051 (1997). 
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Therefore 
mx + myx + mMWox = Fariving, (11.83) 
and the damping factor y is given by 
y=. (11.84) 


[When I wrote Fuamping = —ymv, back in Chapter 9 (Eq. 9.154), I 
assumed for simplicity that the damping was proportional to the 
velocity. We now know that radiation damping, at least, is proportional 
to Y. But it hardly matters: for sinusoidal oscillations any even number 
of derivatives of v would do, since they’re all proportional to v.] 


Example 11.5. An electric dipole consists of a negative charge (—q) 
nailed down at the origin, and a positive charge (+q) held at rest on 
the z-axis (z = d). At time t = 0 the positive charge (mass m) is released, 
and accelerates toward the negative charge. Leaving aside the radiation 
reaction (which is a small perturbation) the force on q is 
1 e. 
7 ATE) z2” oe 


so its acceleration is 


F e 1., 
—= <7. 
m 4T Eom z? 
to 


a= 


Differentiating with respect to f, 


g = r 2P v., 

dt 

(note that dz/dt= — v, since the derivative is negative). The radiation 
reaction (Eq. 11.80) is therefore 


a= == Zz 
4reom \ z? 4reom z? 


po, pog (2) 1, 


Kid = a=- Z. 
6rnc 1272€9m `c’ z? 


The details are of little interest — what’s remarkable is the sign: it’s 
in the same direction as F itself. This is not a brake, but an accelerator! 
At the same time that the system as a whole is losing energy in the 
form of radiation, q is actually gaining energy. (Of course, as it falls 
the particle gains kinetic energy and loses potential energy; absent the 
radiation reaction the total would be constant. We’re talking about 
the extra energy associated with the radiation reaction.) Where is it 
coming from? It’s coming from the fields, which are shedding energy in 
both directions.'© What does this tell us? Not much, I’m afraid, except 
to emphasize that the name “radiation reaction” can be seriously 
misleading. 


Several authors have noted that the Poynting vector close to a decaying electric dipole 
points inward, whereas farther out it points outward (as it must, of course, to describe 
radiation). See H. G. Schantz, Am. J. Phys. 63, 513 (1995), or K. T. McDonald, http:// 
kirkmed.princeton.edu/examples/collapse.pdf. 


497 11.3 The Radiation Reaction 


Problem 11.17. 


(a) A particle of charge q moves in a circle of radius R at a constant speed v. To sustain the 
motion, you must, of course, provide a centripetal force mv?/R; what additional force (F.) 
must you exert, in order to counteract the radiation reaction? [It’s easiest to express the 
answer in terms of the instantaneous velocity v.] What power (Po) does this extra force 
deliver? Compare Pe with the power radiated (use the Larmor formula). 

(b) Repeat part (a) for a particle in simple harmonic motion with amplitude A and angular 
frequency w: w(t) = A cos(wt)2. Explain the discrepancy. 

(c) Consider the case of a particle in free fall (constant acceleration g). What is the radiation 
reaction force? What is the power radiated? Comment on these results. 


Problem 11.18. A point charge q, of mass 7n, is attached to a spring of constant k. At time 
t = Q itis given a kick, so its initial energy is Uo = im. Now it oscillates, gradually radiating 
away this energy. 


(a) Confirm that the total energy radiated is equal to Up. Assume the radiation damping is 
small, so you can write the equation of motion as 


X+ yx + wox =0, 


and the solution as 
x(t) = 26-7" sin(wot), 
wo 

with wo = Vkim,y = wet, and y < wo (drop y? in comparison to Wor and when 
you average over a complete cycle, ignore the change in e™”*). 

(b) Suppose now we have two such oscillators, and we start them off with identical kicks. 
Regardless of their relative positions and orientations, the total energy radiated is 2 Up. 
But what if they are right on top of each other, so it’s equivalent to a single oscillator with 
twice the charge; the Larmor formula says that the power radiated is four times as great, 
suggesting that the total will be 4 Uo. Find the error in this reasoning, and show that the 
total is actually 2 Uo, as it should be.” 


! Problem 11.19. With the inclusion of the radiation reaction force (Eq. 11.80, Newton's second 
law for a charged particle becomes 


a a 
a=Ta+—, 
m 
where F is the external force acting on the particle. 


(a) In contrast to the case of an uncharged particle (a = F/m), acceleration (like position and 
velocity) must now be a continuous function of time, even if the force changes abruptly. 
(Physically, the radiation reaction damps out any rapid change in a.) Prove that a is 


17 For a more sophisticated version of this paradox, see P. R. Berman, Am. J. Phys. 78, 1323 
(2010). 
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continuous at any time ¢, by integrating the equation of motion from (¢ — €) to (t + €) 
and taking the limit € — 0. 

(b) A particle is subjected to a constant force F, beginning at time ¢ = O and lasting until time 
T. Find the most general solution a(t) to the equation of motion in each of the following 
three periods: (i) t < 0, (ii)0 < ¢ < T, (iii) £ > T. 

(c) Impose the continuity condition (a) at t= O and ¢ = T. Show that you can either eliminate 
the runaway in region (iii) or avoid preacceleration in region (i), but not both. 

(d) If you choose to eliminate the runaway, what is the acceleration as a function of time, in 
each interval? How about the velocity? (The latter must, of course, be continuous at £ = O 
and ¢ = T.) Assume the particle was originally at rest: v(—oo) = 0. 

(e) Plota(t) and v(z), both for an uncharged particle and for a (nonrunaway) charged particle, 
subject to this force. 


11.3.2 The Self-Force on a Charged Particle 


In the last section, I derived the Abraham—Lorentz formula for the radia- 
tion reaction, using conservation of energy. I made no attempt to identify 
the actual mechanism responsible for this force, except to point out that 
it must be a recoil effect of the particle’s own fields acting back on the 
charge. Unfortunately, the fields of a point charge blow up right at the 
particle, so it’s hard to see how one might calculate the force they exert.!® 
Let’s avoid this problem by considering an extended charge distribution, 
for which the field is finite everywhere; at the end, we’ll take the limit as the 
size of the charge goes to zero. In general, as I noted in Section 8.2.1, the 
electromagnetic force of one part (A) on another part (B) is not equal and 
opposite to the force of B on A (Fig. 11.16). If the distribution is divided 
up into infinitesimal chunks, and the imbalances are added up for all such 
pairs, the result is a net force of the charge on itself. It is this self-force, 
resulting from the breakdown of Newton’s third law within the structure 
of the particle, that accounts for the radiation reaction. 

Lorentz originally calculated the electromagnetic self-force using a 
spherical charge distribution, which seems reasonable but makes the 
mathematics rather cumbersome.!? Because I am only trying to elucidate 
the mechanism involved, I shall use a less realistic model: a “dumbbell” 
in which the total charge q is divided into two halves separated by a 
fixed distance d (Fig. 11.17). This is the simplest possible arrangement 
of the charge that permits the essential mechanism (imbalance of internal 
electromagnetic forces) to function. Never mind that it’s an unlikely model 
for an elementary particle: in the point limit (d — 0) any model must yield 


18 Tt can be done by a suitable averaging of the field, but it’s not easy. See T. H. Boyer, Am. J. 
Phys. 40, 1843 (1972), and references cited there. 

19 See J. D. Jackson, Classical Electrodynamics, 3rd ed., John Wiley, New York (1999), 
Sect. 16.3. 
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Y4 


sy 


>a 


wa 


Retarded Present 
position x(t,) position x(t) 


the Abraham—Lorentz formula, to the extent that conservation of energy 
alone dictates that answer. 

Let’s assume the dumbbell moves in the x direction, and is (instanta- 
neously) at rest at the retarded time. The electric field at (1) due to (2) is 


_ (qi2) 2 
- 4r€o (2-u)3 


(Eq. 10.72), where 


[(c? +2-a)u—(2-ujal (11.85) 


u=ch and 2=K+d¥, (11.86) 
so 
A-u=c2, a-a=la, and 2=VP +d. (11.87) 


Actually, we’re only interested in the x component of E,, since the 
y components will cancel when we add the forces on the two ends (for the 
same reason, we don’t need to worry about the magnetic forces). Now 


cl 
een 11. 
we (11.88) 
and hence 
q (le - ad’) 
E; = Á, 11.89 
x gree (2 + d3? nee 
By symmetry, E2, = E1, so the net force on the dumbbell is 
2 2 2 
q q (Ic* — ad^) , 
Fat = = (E; + E2) = ~à. 11.90 
oa MRES na (Paes en 


So far everything is exact. The idea now is to expand in powers of d; 
when the size of the particle goes to zero, all positive powers will disappear. 
Using Taylor’s theorem, 


X(t) = X(t) + X(t) (t= tr) + ae =t) + Sea -t H, 
we have 


l= x(t) —x(t,) = Lar + lar pike, (11.91) 
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where T = ¢ — t,, for short. Now T is determined by the retarded time 
condition 


CTP =P +2’, (11.92) 


sO 


T2 2: 
d= er? -B= cTy1- (S++) ET- E OT. 
E E C 


This equation tells us d, in terms of T; we need to “solve” it for T as 
a function of d. There’s a systematic procedure for doing this, known as 
reversion of series,” but we can get the first couple of terms more informally 
as follows: ignoring all higher powers of T, 


d 
dxcT > Tr-; 
c 


using this as an approximation for the cubic term, 


ad d @&@d& 
GOSS 5 > Tarta” 
and so on. Thus 
2 
Tou ra e (11.93) 
c 8c5 


Returning to Eq. 11.91, we construct the power series for / in terms of d: 
a a 
l= sd? + cd? + Od H. 11.94 
a 5a () ( ) 
Putting this into Eq. 11.90, I conclude that 
q a å 
ma a * Tad 
Here a and å are evaluated at the retarded time (t,), but it’s easy to rewrite 
the result in terms of the present time t: 


Feit = + (jd +--- |x. (11.95) 


alt) = a(t) + a(t) (t, — 1) +--+ = a(t) - a(t) T +--+ = a(t) -af +, 


and it follows that 


t e 
4neo! 4ed 38 

The first term on the right is proportional to the acceleration of the 
charge; if we pull it over to the other side of Newton’s second law, it simply 


adds to the dumbbell’s mass. In effect, the total inertia of the charged 
dumbbell is 


Fir = + (jd +--- |X. (11.96) 


2 
q 


Ime eee 
EEO 4TtE 4de?’ 


(11.97) 


20 See, for example, the CRC Standard Mathematical Tables and Formulas, 32nd ed., CRC 
Press, Boca Raton (2011). 
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where mo is the mass of either end alone. In the context of special relativity, 
it is not surprising that the electrical repulsion of the charges should 
enhance the mass of the dumbbell. The potential energy of this configu- 
ration (in the static case) is 


1 (q/2) 
A4TLE9 d á 


(11.98) 


and according to Einstein’s formula E = mc’, this energy contributes to the 
inertia of the object.7! 
The second term in Eq. 11.96 is the radiation reaction: 


im _ Hogra 

rad Re 
It alone (apart from the mass correction’) survives in the “point dumbbell” 
limit d > 0. Unfortunately, it differs from the Abraham—Lorentz formula 
by a factor of 2. But then, this is only the self-force associated with the 
interaction between | and 2 — hence the superscript “int.” There remains 
the force of each end on itself. When the latter is included (see Prob. 11.20), 
the result is 


(11.99) 


Logra 


Faga = 
rad Bae 


(11.100) 


reproducing the Abraham—Lorentz formula exactly. Conclusion: The radia- 
tion reaction is due to the force of the charge on itself — or, more elaborately, 
the net force exerted by the fields generated by different parts of the charge 
distribution acting on one another. 


Problem 11.20. Deduce Eq. 11.100 from Eq. 11.99. Here are three methods: 


(a) Use the Abraham—Lorentz formula to determine the radiation reaction on each end of the 
dumbbell; add this to the interaction term (Eq. 11.99). 

(b) Method (a) has the defect that it uses the Abraham—Lorentz formula — the very thing that 
we were trying to derive. To avoid this, let F(q) be the total ¢-independent part of the 
self-force on a charge q. Then 


F(q) = F™ (q) + 2F(q/2), 


21 The fact that the numbers work out perfectly is a lucky feature of this configuration. If you 
do the same calculation for the dumbbell in /ongitudinal motion, the mass correction is only 
half of what it “should” be (there’s a 2, instead of a 4, in Eq. 11.97), and for a sphere it’s 
off by a factor of 3/4. This notorious paradox has been the subject of much debate over 
the years. See D. J. Griffiths and R. E. Owen, Am. J. Phys. 51, 1120 (1983). 

Of course, the limit d — 0 has an embarrassing effect on the mass term. In a sense, 
it doesn’t matter, since only the total mass m is observable; maybe mg somehow has a 
compensating (negative!) infinity, so that m comes out finite. This awkward problem persists 
in quantum electrodynamics, where it is “swept under the rug” in a procedure known as 
mass renormalization. 
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where F'™* is the interaction part (Eq. 11.99), and F'(q/2) is the self-force on each end. 
Now, F(q) must be proportional to 4%, since the field is proportional to q and the force is 
gE. So F(q/2) = (1/4) F(q). Take it from there. 

(c) Smear out the charge along a strip of length Z oriented perpendicular to the motion 
(the charge density, then, is A = g/L); find the cumulative interaction force for all pairs 
of segments, using Eq. 11.99 (with the correspondence g/2 — A dy, at one end and 
q/2 — A dyn at the other). Make sure you don’t count the same pair twice. 


! Problem 11.21.” An electric dipole rotates at constant angular velocity œ in the xy plane. 
(The charges, +g, are atrs. = +R[cos(w#)x + sin(wt)y]; the magnitude of the dipole 
momentis p = 2q R.) 


(a) Find the interaction term in the self-torque (analogous to Eq. 11.99). Assume the motion is 
nonrelativistic (w R <« c). 

(b) Use the method of Prob. 11.20a to obtain the total radiation reaction torque on this system. 
[Answer: —( upp? w?) 2/6rc.] 

(c) Check that this result is consistent with the power radiated (Eq. 11.60). 


More Problems on Chapter 11 
[any 


Problem 11.22. A particle of mass 7 and charge q is attached to a spring with force constant 
k, hanging from the ceiling (Fig. 11.18). Its equilibrium position is a distance / above the floor. 
It is pulled down a distance d below equilibrium and released, at time ¢ = 0. 


(a) Under the usual assumptions (d < A < h), calculate the intensity of the radia- 
tion hitting the floor, as a function of the distance R from the point directly below q. 
[Note: The intensity here is the average power per unit area of the floor.] At what R 
is the radiation most intense? Neglect the radiative damping of the oscillator. [Answer: 
og? dwt R*h/3217 c( R? +h)" 


ann 


Fig. 11.18 


23 For related problems, see D. R. Stump and G. L. Pollack, Am. J. Phys. 65, 81 (1997); D. 
Griffiths and E. Szeto, Am. J. Phys. 46, 244 (1978). 
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(b) As a check on your formula, assume the floor is of infinite extent, and calculate the average 
energy per unit time striking the entire floor. Is it what you'd expect? 

(c) Because it is losing energy in the form of radiation, the amplitude of the oscillation will 
gradually decrease. After what time t has the amplitude been reduced to d/e? (Assume 
the fraction of the total energy lost in one cycle is very small.) 


Problem 11.23. A radio tower rises to height /: above flat horizontal ground. At the top is a 
magnetic dipole antenna, of radius b, with its axis vertical. FM station KRUD broadcasts from this 
antenna at (angular) frequency œ, with a total radiated power P (that’s averaged, of course, 
over a full cycle). Neighbors have complained about problems they attribute to excessive radia- 
tion from the tower — interference with their stereo systems, mechanical garage doors opening 
and closing mysteriously, and a variety of suspicious medical problems. But the city engineer 
who measured the radiation level at the base of the tower found it to be well below the accepted 
standard. You have been hired by the Neighborhood Association to assess the engineer's report. 


(a) In terms of the variables given (not all of which may be relevant), find the formula for the 
intensity of the radiation at ground level, a distance R from the base of the tower. You 
may assume that b < c/w < h. [Note: We are interested only in the magnitude of the 
radiation, not in its direction — when measurements are taken, the detector will be aimed 
directly at the antenna.] 

(b) How far from the base of the tower should the engineer have made the measurement? 
What is the formula for the intensity at this location? 

(c) KRUD’s actual power output is 35kW, its frequency is 90 MHz, the antenna’s 
radius is 6 cm, and the height of the tower is 200 m. The city’s radio-emission limit is 
200 uW/cm?. Is KRUD in compliance? 


+po cos ot 


d 


| 


—Po COS Ot 


Fig. 11.19 


Problem 11.24. As a model for electric quadrupole radiation, consider two oppositely oriented 
oscillating electric dipoles, separated by a distance d, as shown in Fig. 11.19. Use the results of 
Section 11.1.2 for the potentials of each dipole, but note that they are not located at the origin. 
Keeping only the terms of first order in d: 


(a) find the scalar and vector potentials; 
(b) find the electric and magnetic fields; 
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(c) find the Poynting vector and the power radiated, then sketch the intensity profile as a 
function of 0. 


Problem 11.25. The geographic poles of the Earth are defined by its axis of rotation; the 
magnetic poles are off (from this axis) by about 11° (and they wander a bit). The north pole of 
a magnetic compass points (roughly) northward, so the Earth's magnetic south pole is actually 
pretty close to its geographic north pole. (Why is this so complicated?) Relative to the fixed axis 
of rotation, therefore, the magnetic dipole moment of the Earth is changing with time, and the 
Earth must be giving off magnetic dipole radiation. 


(a) Find the formula for the total power radiated, in terms of the following parameters: y 
(the angle between the geographic and magnetic poles), M (the magnitude of the Earth's 
magnetic dipole moment), and w (the angular velocity of rotation of the Earth). [Hint: 
Refer to Prob. 11.4 or Prob. 11.11.] 

(b) Using the fact that the Earth's magnetic field is about half a gauss at the equator, estimate 
the magnetic dipole moment M of the Earth. 

(c) Find the power radiated. [Answer: 4 x 107$ W.] 

(d) Pulsars are thought to be rotating neutron stars, with a typical radius of 10 km, a 
rotational period of 107° s, and a surface magnetic field of 108 T. What sort of radiated 
power would you expect from such a star?” [Answer: 2 x 10° W.] 


Problem 11.26. An ideal electric dipole is situated at the origin; its dipole moment points in 
the z direction, and is quadratic in time: 


ee 
P(t) = spot (=o < t< o), 
where po is a constant. 


(a) Use the method of Section 11.1.2 to determine the (exact) electric and magnetic fields, for all 
r > Q (there's also a delta-function term at the origin, but we're not concerned with that). 
[Partial answer: V = (uopo/87) cos O[(ctlr)? — 1], A = (uop/4rtc)[(et/r) — 
1]z.] 

(b) Calculate the power, P(r, £), passing through a sphere of radius r. 

[Answer: (75/1 2rcegr?)t[? + (r/c)?] J 

(c) Find the total power radiated (Eq. 11.2), and check that your answer is consistent with 

Eq. 11.60.” 


Problem 11.27. In Section 11.2.1 we calculated the energy per unit time radiated by a (nonrel- 
ativistic) point charge — the Larmor formula. 


24 J, P. Ostriker and J. E. Gunn, Astrophys. J. 157, 1395 (1969). 
25 Notice that B(r, t) goes like 1/r?, and one might therefore assume that this configuration 


does not radiate. However, it is not B(r, t) we require (for Eq. 11.2), but rather B(r, fo + r/c) 
— we track the fields as they propagate out to infinity — and B(r, tọ + r/c) has a term that 
goes like 1/r. 
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(a) In the same spirit, calculate the momentum per unit time radiated. [Answer: 
(uoq?/67c7)arv.] 

(b) Similarly, calculate the angular momentum per unit time radiated. [Answer: 
(u0q7/671¢) (vx a).] 


Problem 11.28. Suppose the (electrically neutral) yz plane carries a time-dependent but 
uniform surface current K (t)z. 


(a) Find the electric and magnetic fields at a height x above the plane if 
(i) a constant current is turned on at ¢ = 0: 


0, +t¢<0, 
Ko, t>0. 


K(t) = 


(ii) a linearly increasing current is turned on at ¢ = 0: 


0, +¢<0, 


at, t>0. 


K(t) = 


(b) Show that the retarded vector potential can be written in the form 


A(x, f) = EC, [x (0-2 =u) du 
0 


c 


and from this determine E and B. 
(c) Show that the total power radiated per unit area of surface is 


oc 
Ko? 
Explain what you mean by “radiation,’ in this case, given that the source is not 


localized. 


Problem 11.29. Use the duality transformation (Prob. 7.68) to construct the electricand mag- 
netic fields of a magnetic monopole qm in arbitrary motion, and find the “Larmor formula’ for 
the power radiated.” 


Problem 11.30. Assuming you exclude the runaway solution in Prob. 11.19, calculate: 


(a 
(b 
(c 
(d 


the work done by the external force; 
the final kinetic energy (assume the initial kinetic energy was zero); 
the total energy radiated. 


Check that energy is conserved in this process.”8 


fe, AS 


26 For discussion and related problems, see B. R. Holstein, Am. J. Phys. 63, 217 (1995); T. A. 
Abbott and D. J. Griffiths, Am. J. Phys. 53, 1203 (1985). 

27 For related applications, see J. A. Heras, Am. J. Phys. 63, 242 (1995). 

28 Problems 11.30 and 11.31 were suggested by G. L. Pollack. 
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Problem 11.31. 


(a) Repeat Prob. 11.19, but this time let the external force be a Dirac delta function: F(t) = 
k6(t) (for some constant &).” [Note that the acceleration is now discontinuous at t = 0 
(though the velocity must still be continuous); use the method of Prob. 11.19a to show that 
Aa = —k/mt. In this problem there are only two intervals to consider: (i) £ < O and (ii) 
t>0] 

(b) As in Prob. 11.30, check that energy is conserved in this process. 


Problem 11.32. A charged particle, traveling in from —co along the x-axis, encounters a 
rectangular potential energy barrier 


Uy if0<x<L, 
Ue) = 
O otherwise. 
Show that, because of the radiation reaction, it is possible for the particle to tunnel through 


the barrier — that is, even if the incident kinetic energy is less than Uo, the particle can pass 
through.” [Hint: Your task is to solve the equation 


subject to the force 

F(x) = Up|-6(x) + d(x - L)). 
Refer to Probs. 11.19 and 11.31, but notice that this time the force is a specified function of x, not 
t. There are three regions to consider: (i) x < 0, (ii)O0 < x < L, (iii) x > L. Find the 
general solution for a(t), v(t), and x(t) in each region, exclude the runaway in region (iii), 


and impose the appropriate boundary conditions at x = O and x = L. Show that the final 
velocity (vy) is related to the time 7 spent traversing the barrier by the equation 


L=veT - Gael +T-'), 
a ay 


and the initial velocity (at x = —oo) is 
Up 1 


[= 
myl 14+ S(e Tt —1) 
vi 


Vi = vf = 


To simplify these results (since all we're looking for is a specific example), suppose the final 
kinetic energy is half the barrier height. Show that in this case 
e 
ee 
1 — (Liver) 

In particular, if you choose Z = v¢T/4, then v; = (4/3)v,, the initial kinetic energy is 
(8/9) Uo, and the particle makes it through, even though it didn't have sufficient energy to 
get over the barrier!] 


2 This example was first analyzed by P. A. M. Dirac, Proc. R. Soc. A167, 148 (1938). 
30 F, Denef et al., Phys. Rev. E 56, 3624 (1997). 
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Problem 11.33. 


(a) Find the radiation reaction force on a particle moving with arbitrary velocity in a straight 
line, by reconstructing the argument in Section 11.3.2 without assuming v(t,) = 0. 
(Answer: (ugg? y*/6r0c)(a + 3y7a’vic*),] 

(b) Show that this result is consistent (in the sense of Eq. 11.78) with the power radiated by such 
a particle (Eq. 11.75). 


Problem 11.34. 


(a) Does a particle in hyperbolic motion (Eq. 10.52) radiate? (Use the exact formula (Eq. 11.75) 
to calculate the power radiated.) 

(b) Does a particle in hyperbolic motion experience a radiation reaction? (Use the exact formula 
(Prob. 11.33) to determine the reaction force.) 


[Comment: These famous questions carry important implications for the principle of equiva- 
lence.” 


Problem 11.35. Use the result of Prob. 10.35 to determine the power radiated by an ideal 
electric dipole, p(z), at the origin. Check that your answer is consistent with Eq. 11.22, in the case 
of sinusoidal time dependence, and with Prob. 11.26, in the case of quadratic time dependence. 


Problem 11.36. An infinite plane (might as well make it the yz plane) is electrically neutral 
but carries a sinusoidal surface current 


K(t) = Ko cos(wf)z. 


(a) By whatever means you can think of (short of plagiarism) deduce the electric and magnetic 
fields it produces. (If you use guesswork or divine inspiration, be sure to check that your 
results satisfy Maxwell's equations and the appropriate boundary conditions.) 

(b) Construct the Poynting vector and its time average. By integrating the latter over a “pillbox” 
straddling the current, determine the (average) power radiated, per unit area. 

(c) Check that your answer to (b) is consistent with the power delivered to the surface by 
whoever is driving the current. 


31 T, Fulton and F. Rohrlich, Ann. Phys. 9, 499 (1960); J. Cohn, Am. J. Phys. 46, 225 (1978); 


R. Peierls, Surprises in Theoretical Physics, Princeton University Press, Princeton, NJ 
(1979), Chap. 8; the article by P. Pearle in D. Teplitz, ed., Electromagnetism: Paths to 
Research, Plenum Press, New York (1982); C. de Almeida and A. Saa, Am. J. Phys. 74, 
154 (2006). 


Electrodynamics and Relativity 


12.1 The Special Theory of Relativity 


12.1.1 Einstein's Postulates 


Classical mechanics obeys the principle of relativity: the same laws apply in 
any inertial reference frame. By “inertial” I mean that the system is at rest or 
moving with constant velocity.! Imagine, for example, that you have loaded 
a billiard table onto a railroad car, and the train is going at constant speed 
down a smooth straight track. The game will proceed exactly the same as 
it would if the train were parked in the station; you don’t have to “correct” 
your shots for the fact that the train is moving — indeed, if you pulled all the 
curtains, you would have no way of knowing whether the train was moving 
or not. Notice by contrast that you know immediately if the train speeds 
up, or slows down, or rounds a corner, or goes over a bump — the billiard 
balls roll in weird curved trajectories, and you yourself feel a lurch and spill 
coffee on your shirt. The laws of mechanics, then, are certainly not the same 
in accelerating reference frames. 

In its application to classical mechanics, the principle of relativity is 
hardly new; it was stated clearly by Galileo. Question: Does it also apply 
to the laws of electrodynamics? At first glance, the answer would seem to 
be no. After all, a charge in motion produces a magnetic field, whereas 
a charge at rest does not. A charge carried along by the train would 
generate a magnetic field, but someone on the train, applying the laws of 
electrodynamics in that system, would predict no magnetic field. In fact, 
many of the equations of electrodynamics, starting with the Lorentz force 
law, make explicit reference to “the” velocity of the charge. It certainly 
appears, therefore, that electromagnetic theory presupposes the existence 
of a unique stationary reference frame, with respect to which all velocities 
are to be measured. 


! This raises an awkward problem: if the laws of physics hold just as well in a uniformly 
moving frame, then we have no way of identifying the “rest” frame in the first place, and 
hence no way of checking that some other frame is moving at constant velocity. To avoid 
this trap, we define an inertial frame formally as one in which Newton's first law holds. If you 
want to know whether youre in an inertial frame, throw some rocks around —if they travel in 
straight lines at constant speed, you’ve got yourself an inertial frame, and any frame moving 
at constant velocity with respect to you will be another inertial frame (see Prob. 12.1). 
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Wire loop 


WETS) ÉE 


And yet there is an extraordinary coincidence that caught Einstein’s eye. 
Suppose we mount a wire loop on a freight car, and have the train pass 
between the poles of a giant magnet (Fig. 12.1). As the loop rides through 
the magnetic field, a motional emf is established; according to the flux rule 
(Eq. 7.13), 


This emf, remember, is due to the magnetic force on charges in the wire loop, 
which are moving along with the train. On the other hand, if someone on 
the train naively applied the laws of electrodynamics in that system, what 
would the prediction be? No magnetic force, because the loop is at rest. But 
as the magnet flies by, the magnetic field in the freight car changes, and 
a changing magnetic field induces an electric field, by Faraday’s law. The 
resulting electric force would generate an emf in the loop given by Eq. 7.23: 


Because Faraday’s law and the flux rule predict exactly the same emf, people 
on the train will get the right answer, even though their physical interpretation 
of the process is completely wrong! 

Or is it? Einstein could not believe this was a mere coincidence; he took 
it, rather, as a clue that electromagnetic phenomena, like mechanical ones, 
obey the principle of relativity. In his view, the analysis by the observer 
on the train is just as valid as that of the observer on the ground. If their 
interpretations differ (one calling the process electric, the other magnetic), 
so be it; their actual predictions are in agreement. Here’s what he wrote on 
the first page of his 1905 paper introducing the special theory of relativity: 


It is known that Maxwell’s electrodynamics — as usually understood at 
the present time — when applied to moving bodies, leads to asymmetries 
which do not appear to be inherent in the phenomena. Take, for example, 
the reciprocal electrodynamic action of a magnet and a conductor. The 
observable phenomenon here depends only on the relative motion of 
the conductor and the magnet, whereas the customary view draws a 
sharp distinction between the two cases in which either one or the other 
of these bodies is in motion. For if the magnet is in motion and the 
conductor at rest, there arises in the neighborhood of the magnet an 


510 


12 Electrodynamics and Relativity 


electric field... producing a current at the places where parts of the 
conductor are situated. But if the magnet is stationary and the conductor 
in motion, no electric field arises in the neighborhood of the magnet. 
In the conductor, however, we find an electromotive force ... which gives 
rise — assuming equality of relative motion in the two cases discussed — 
to electric currents of the same path and intensity as those produced by 
the electric forces in the former case. 

Examples of this sort, together with unsuccessful attempts to discover 
any motion of the earth relative to the “light medium,” suggest that 
the phenomena of electrodynamics as well as of mechanics possess no 
properties corresponding to the idea of absolute rest.” 


But I’m getting ahead of the story. To Einstein’s predecessors, the 
equality of the two emfs was just a lucky accident; they had no doubt that 
one observer was right and the other was wrong. They thought of electric 
and magnetic fields as strains in an invisible jellylike medium called ether, 
which permeates all of space. The speed of the charge was to be measured 
with respect to the ether — only then would the laws of electrodynamics be 
valid. The train observer is wrong, because that frame is moving relative to 
the ether. 

But wait a minute! How do we know the ground observer isn’t moving 
relative to the ether, too? After all, the Earth rotates on its axis once a day 
and revolves around the Sun once a year; the solar system circulates around 
the galaxy, and for all I know the Galaxy itself is moving at a high speed 
through the cosmos. All told, we should be traveling at well over 50 km/s 
with respect to the ether. Like a motorcycle rider on the open road, we face 
an “ether wind” of high velocity — unless by some miraculous coincidence 
we just happen to find ourselves in a tailwind of precisely the right strength, 
or the Earth has some sort of “windshield” and drags its local supply of 
ether along with it. Suddenly it becomes a matter of crucial importance 
to find the ether frame, experimentally, or else all our calculations will be 
invalid. 

The problem, then, is to determine our motion through the ether — to 
measure the speed and direction of the “ether wind.” How shall we do 
it? At first glance you might suppose that practically any electromagnetic 
experiment would suffice: if Maxwell’s equations are valid only with respect 
to the ether frame, any discrepancy between the experimental result and the 
theoretical prediction would be ascribable to the ether wind. Unfortunately, 
as nineteenth-century physicists soon realized, the anticipated error in a 
typical experiment is extremely small; as in the example above, “coinci- 
dences” always seem to conspire to hide the fact that we are using the 
“wrong” reference frame. So it takes an uncommonly delicate experiment 
to do the job. 


? A translation of Einstein’s first relativity paper, “On the Electrodynamics of Moving 
Bodies,” is reprinted in H. A. Lorentz et al., The Principle of Relativity, Dover, New York 
(1923). 
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Speed of light 
< i > 
—~ 
Ether wind 


Now, among the results of classical electrodynamics is the prediction 

that electromagnetic waves travel through the vacuum at a speed 

1 

relative (presumably) to the ether. In principle, then, one should be able to 
detect the ether wind by simply measuring the speed of light in various 
directions. Like a motorboat on a river, the net speed “downstream” should 
be a maximum, for here the light is swept along by the ether; in the opposite 
direction, where it is bucking the current, the speed should be a minimum 
(Fig. 12.2). 

While the idea of this experiment could not be simpler, its execution is 
another matter, because light travels so inconveniently fast. If it weren’t for 
that “technical detail,” you could do it with a flashlight and a stopwatch. 
As it happened, an elaborate and lovely experiment was conducted by 
Michelson and Morley in the 1880s, using an optical interferometer of 
fantastic precision. I shall not go into the details here, because I do not want 
to distract your attention from the two essential points: (1) all Michelson 
and Morley were trying to do was compare the speed of light in different 
directions, and (2) what they in fact discovered was that this speed is exactly 
the same in all directions. 

Nowadays, when students are taught in high school to snicker at the 
naïveté of the ether model, it takes some imagination to comprehend how 
utterly perplexing this result must have been at the time. All other waves 
(water waves, sound waves, waves on a string) travel at a prescribed speed 
relative to the propagating medium (the stuff that does the waving), and 
if this medium is in motion with respect to the observer, the net speed is 
always greater “downstream” than “upstream.” Over the next 20 years, a 
series of improbable schemes were concocted in an effort to explain why this 
does not occur with light. Michelson and Morley themselves interpreted 
their experiment as confirmation of the “ether drag” hypothesis, which 
held that the Earth somehow pulls the ether along with it. But this was 
found to be inconsistent with other observations, notably the aberration of 
starlight.? Various so-called “emission” theories were proposed, according 
to which the speed of electromagnetic waves is governed by the motion of 


= 3.00 x 108 m/s, 


3 A discussion of the Michelson—Morley experiment and related matters is to be found in 
R. Resnick’s Introduction to Special Relativity, John Wiley, New York (1968), Chap. 1. 
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the source — as it would be in a corpuscular theory (conceiving of light as a 
stream of particles). Such theories called for implausible modifications in 
Maxwell’s equations, but in any event they were discredited by experiments 
using extraterrestrial light sources. Meanwhile, Fitzgerald and Lorentz 
suggested that the ether wind physically compresses all matter (including 
the Michelson—Morley apparatus itself) in just the right way to compensate 
for, and thereby conceal, the variation in speed with direction. As it turns 
out, there is a grain of truth in this, although their idea of the reason for 
the contraction was quite wrong. 

At any rate, it was not until Einstein that anyone took the Michelson— 
Morley result at face value, and suggested that the speed of light is a 
universal constant, the same in all directions, regardless of the motion 
of the observer or the source. There is no ether wind because there is no 
ether. Any inertial system is a suitable reference frame for the application 
of Maxwell’s equations, and the velocity of a charge is to be measured not 
with respect to a (nonexistent) absolute rest frame, nor with respect to a 
(nonexistent) ether, but simply with respect to the particular inertial system 
you happen to have chosen. 

Inspired, then, both by internal theoretical hints (the fact that the laws 
of electrodynamics are such as to give the right answer even when applied 
in the “wrong” system) and by external empirical evidence (the Michelson— 
Morley experiment*), Einstein proposed his two famous postulates: 


(1) The Principle of Relativity. The laws of physics apply in all inertial 
reference systems. 


(2) The Universal Speed of Light. The speed of light in vacuum is the 
same for all inertial observers, regardless of the motion of the source. 


The special theory of relativity is based on these two postulates. The first 
elevates Galileo’s observation about classical mechanics to the status of a 
general law, applying to all of physics. It states that there is no absolute 
rest system. The second might be considered Einstein’s response to the 
Michelson—Morley experiment. It means that there is no ether. (Some 
authors consider Einstein’s second postulate redundant — no more than 
a special case of the first. They maintain that the very existence of ether 
would violate the principle of relativity, in the sense that it would define a 
unique stationary reference frame. I think this is nonsense. The existence 
of air as a medium for sound does not invalidate the theory of relativity. 
Ether is no more an absolute rest system than the water in a goldfish bowl — 
which is a special system, if you happen to be the goldfish, but scarcely 
“absolute.”>) 


4 Actually, Einstein appears to have been only dimly aware of the Michelson—Morley experi- 
ment at the time. For him, the theoretical argument was decisive. 

5 I put it this way in an effort to dispel some misunderstanding as to what constitutes an 
absolute rest frame. In 1977 it became possible to measure the speed of the Earth through 
the 3 K background radiation left over from the “big bang.” Does this mean we have found 
an absolute rest system, and relativity is out the window? Of course not. 
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Unlike the principle of relativity, which had roots going back several 
centuries, the universal speed of light was radically new — and, on the face 
of it, preposterous. For if I walk 5 mi/h down the corridor of a train going 
60 mi/h, my net speed relative to the ground is “obviously” 65 mi/h — the 
speed of A (me) with respect to C (ground) is equal to the speed of A relative 
to B (train) plus the speed of B relative to C: 


Vac = VAB + VBC. (12.1) 


And yet, if A is a light signal (whether it comes from a flashlight on the train 
or a lamp on the ground or a star in the sky) Einstein would have us believe 
that its speed is c relative to the train and c relative to the ground: 


VAC = VAB =F C. (12.2) 


Clearly, Eq. 12.1, which we now call Galileo’s velocity addition rule (no one 
before Einstein would have bothered to give it a name at all) is incompatible 
with the second postulate. In special relativity, as we shall see, it is replaced 
by Einstein’s velocity addition rule: 


VAB + VBC (12.3) 


vye = e—a 
1+ (vapVpclc?) 


For “ordinary” speeds (v4g « c and vgc < c), the denominator is so close 
to | that the discrepancy between Galileo’s formula and Einstein’s formula 
is negligible. On the other hand, Einstein’s formula has the desired property 
that if v4g = c, then automatically vac = c: 


et VBC 
E> 1+ (cvgclc?) =e 
But how can Galileo’s rule, which seems to rely on nothing but common 
sense, possibly be wrong? And if it is wrong, what does this do to all 
of classical physics? The answer is that special relativity compels us to 
alter our notions of space and time themselves, and therefore also of 
such derived quantities as velocity, momentum, and energy. Although it 
developed historically out of Einstein’s contemplation of electrodynamics, 
the special theory is not limited to any particular class of phenomena — 
rather, it is a description of the space-time “arena” in which all physical 
phenomena take place. And in spite of the reference to the speed of light 
in the second postulate, relativity has nothing to do with light: c is a 
fundamental velocity, and it happens that light travels at that speed, but 
it is perfectly possible to conceive of a universe in which there are no 
electric charges, and hence no electromagnetic fields or waves (light), and 
yet relativity would still prevail. Because relativity defines the structure of 
space and time, it claims authority not merely over all presently known 
phenomena, but over those not yet discovered. It is, as Kant would say, 
a “prolegomenon to any future physics.” 
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Problem 12.1. Let S bean inertial reference system. Use Galileo's velocity addition rule. 


(a) Suppose that S moves with constant velocity relative to S. Show that Sis also an inertial 
reference system. [Hint: Use the definition in footnote 1.] 

(b) Conversely, show that if Sis an inertial system, then it moves with respect to S at constant 
velocity. 


Problem 12.2. As an illustration of the principle of relativity in classical mechanics, consider 
the following generic collision: In inertial frame S, particle A (mass m4, velocity u4) hits 
particle B (mass m g, velocity uz). In the course of the collision some mass rubs off A and onto 
B, and we are left with particles C (mass mc, velocity uc) and D (mass mp, velocity up). 
Assume that momentum (p = mu) is conserved in S. 


(a) Prove that momentum is also conserved in inertial frame S, which moves with velocity v 
relative to S. [Use Galileo's velocity addition rule — this is an entirely classical calculation. 
What must you assume about mass?] 

(b) Suppose the collision is elastic in S; show that it is also elastic in S 


Problem 12.3. 


(a) What's the percentage error introduced when you use Galileo’s rule, instead of Einstein’s, 
with v4g = 5 mi/hand vgc = 60 mi/h? 

(b) Suppose you could run at half the speed of light down the corridor of a train going three- 
quarters the speed of light. What would your speed be relative to the ground? 

(c) Prove, using Eq. 12.3, that if v4g <c and vgc <c then vyc<c. Interpret this 
result. 


Fig. 12.3 


Problem 12.4. As the outlaws escape in their getaway car, which goes xe, the police officer 
fires a bullet from the pursuit car, which only goes 5c(Fig. 12.3). The muzzle velocity of the bullet 
(relative to the gun) is te. Does the bullet reach its target (a) according to Galileo, (b) according 
to Einstein? 


12.1.2 The Geometry of Relativity 


In this section I present a series of gedanken (thought) experiments that 
serve to introduce the three most striking geometrical consequences of 
Einstein’s postulates: time dilation, Lorentz contraction, and the relativity 
of simultaneity. In Section 12.1.3 the same results will be derived more 
systematically, using Lorentz transformations. 
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(b) aannam] (a) 


(b) ‘GK (a) 


(i) The relativity of simultaneity. Imagine a freight car, traveling at 
constant speed along a smooth, straight track (Fig. 12.4). In the very center 
of the car there hangs a light bulb. When someone switches it on, the light 
spreads out in all directions at speed c. Because the lamp is equidistant from 
the two ends, an observer on the train will find that the light reaches the 
front end at the same instant as it reaches the back end: the two events 
in question — (a) light reaches the front end (and maybe a buzzer goes 
off) and (b) light reaches the back end (another buzzer sounds) — occur 
simultaneously. 

However, to an observer on the ground these same two events are not 
simultaneous. For as the light travels out from the bulb (going at speed c in 
both directions — that’s the second postulate), the train itself moves forward, 
so the beam going to the back end has a shorter distance to travel than the 
one going forward (Fig. 12.5). According to this observer, therefore, event 
(b) happens before event (a). An observer passing by on an express train, 
meanwhile, would report that (a) preceded (b). Conclusion: 


Two events that are simultaneous in one inertial system are not, in 
general, simultaneous in another. 


Naturally, the train has to be going awfully fast before the discrepancy 
becomes detectable — that’s why you don’t notice it all the time. 

Of course, it’s always possible for naive witnesses to be mistaken about 
simultaneity: people sitting in the back corner of the car would hear buzzer 
b before buzzer a, simply because they’re closer to the source of the sound, 
and a child might infer that b actually rang before a. But this is a trivial 
error, having nothing to do with special relativity — obviously, you must 
correct for the time the signal (sound, light, carrier pigeon, or whatever) 
takes to reach you. When I speak of an observer, I mean someone with 
the sense to make this correction, and an observation is what is obtained 
after doing so. What you hear or see, therefore, is not the same as what you 
observe. An observation is an artificial reconstruction after the fact, when 
all the data are in, and it doesn’t depend on where the observer is located. 
In fact, a wise observer will avoid the whole problem by stationing assistants 
at strategic locations, each equipped with a watch synchronized to a master 
clock, so that time measurements can be made right at the scene. I belabor 
this point in order to emphasize that the relativity of simultaneity is a 
genuine discrepancy between measurements made by competent observers 
in relative motion, not a silly mistake arising from a failure to account for 
the travel time of light signals. 
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Problem 12.5. Synchronized clocks are stationed at regular intervals, a million kilometers 
apart, along a straight line. The clock next to you reads 12 noon: 


(a) What time do you see on the 90th clock down the line? 
(b) What time do you observe on that clock? 


As 


Y 
e 


To Earth 
Fig. 12.6 


Problem 12.6. Every 2 years, more or less, the New York Times publishes an article in which 
some astronomer claims to have found an object traveling faster than the speed of light. Many 
of these reports result from a failure to distinguish what is seen from what is observed — that is, 
from a failure to account for light travel time. Here's an example: A star is traveling with speed v 
atan angle O to the line of sight (Fig. 12.6). What is its apparent speed across the sky? (Suppose 
the light signal from b reaches the Earth at a time A¢ after the signal from a, and the star 
has meanwhile advanced a distance As across the celestial sphere; by “apparent speed,” | mean 
As/ At.) What angle O gives the maximum apparent speed? Show that the apparent speed can 
be much greater than c, even if v itself is less than c. 


(ii) Time dilation. Now let’s consider a light ray that leaves the bulb and 
strikes the floor of the car directly below. Question: How long does it take 
the light to make this trip? From the point of view of an observer on the 
train, the answer is easy: if the height of the car is h, the time is 


_ h 
At=-. (12.4) 

C 
(TIl use an overbar to denote measurements made on the train.) On the 
other hand, as observed from the ground, this same ray must travel farther, 


because the train itself is moving. From Fig. 12.7, I see that this distance is 


yh? + (vAt)?, so 
At Nh + (vAt)? 


Solving for At, we have 
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and therefore 


At = y1 - v*/c?At. (12.5) 


Evidently, the time elapsed between the same two events — (a) light leaves 
bulb, and (b) light strikes center of floor — is different for the two observers. 
In fact, the interval recorded on the train clock, Af, is shorter by the factor 


y= _ (12.6) 


Conclusion: 
Moving clocks run slow. 


This is called time dilation. It doesn’t have anything to do with the 
mechanics of clocks; it’s a statement about the nature of time, and applies 
to all properly functioning timepieces. 

Of all Einstein’s predictions, none has received more spectacular and 
persuasive confirmation than time dilation. Most elementary particles are 
unstable: they disintegrate after a characteristic lifetime® that varies from 
one species to the next. The lifetime of a neutron is 15 min; of a muon, 2 x 
1076 s; and of a neutral pion, 9 x 107!” s. But these are lifetimes of particles 
at rest. When particles are moving at speeds close to c they last much longer, 
for their internal clocks (whatever it is that tells them when their time is up) 
are running slow, in accordance with Einstein’s time dilation formula. 


Example 12.1. A muon is traveling through the laboratory at three- 
fifths the speed of light. How long does it last? 


Solution. In this case, 
y= : 
VI- G/5) 


so it lives longer (than at rest) by a factor of 3: 


; x (2x 107°)s =2.5x 107s. 


6 Actually, an individual particle may last longer or shorter than this. Particle disintegration 
is a random process, and I should really speak of the average lifetime for the species. But to 
avoid irrelevant complication, I shall pretend that every particle disintegrates after precisely 
the average lifetime. 
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It may strike you that time dilation is inconsistent with the principle of 
relativity. For if the ground observer says the train clock runs slow, the train 
observer can with equal justice claim that the ground clock runs slow — after 
all, from the train’s point of view it is the ground that is in motion. Who’s 
right? Answer: They’re both right! On closer inspection, the “contradiction,” 
which seems so stark, evaporates. Let me explain: In order to check the 
rate of the train clock, the ground observer uses two of his own clocks 
(Fig. 12.8): one to compare times at the beginning of the interval, when 
the train clock passes point A, the other to compare times at the end of the 
interval, when the train clock passes point B. Of course, he must be careful 
to synchronize his clocks before the experiment. What he finds is that while 
the train clock ticked off, say, 3 minutes, the interval between his own two 
clock readings was 5 minutes. He concludes that the train clock runs slow. 

Meanwhile, the observer on the train is checking the rate of the ground 
clock by the same procedure: she uses two carefully synchronized train 
clocks, and compares times with a single ground clock as it passes by 
each of them in turn (Fig. 12.9). She finds that while the ground clock 
ticks off 3 minutes, the interval between her train clocks is 5 minutes, 
and concludes that the ground clock runs slow. Is there a contradiction? 
No, for the two observers have measured different things. The ground 
observer compared one train clock with two different ground clocks; the 
train observer compared one ground clock with two different train clocks. 
Each followed a sensible and correct procedure, comparing a single moving 
clock with two stationary ones. “So what,” you say, “the stationary clocks 
were synchronized in each instance, so it cannot matter that they used two 
different ones.” But there’s the rub: Clocks that are properly synchronized 
in one system will not be synchronized when observed from another system. 
They can’t be, for to say that two clocks are synchronized is to say that 
they read 12 noon simultaneously, and we have already learned that what’s 
simultaneous to one observer is not simultaneous to another. So whereas 
each observer conducted a perfectly sound measurement, from his/her 
own point of view, the other observer (watching the process) considers 
that she/he made the most elementary blunder in the book, by using two 
unsynchronized clocks. That’s how, in spite of the fact that his clocks 
“actually” run slow, he manages to conclude that hers are running slow (and 
vice versa). 


Train clock 
Q Train clock Train clock A 


A > B © f 5 


O O 


© © = = 
Ground clock A i Ground clock B =—(¥) 


Ground clock 
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Because moving clocks are not synchronized, it is essential when check- 
ing time dilation to focus attention on a single moving clock. All moving 
clocks run slow by the same factor, but you can’t start timing on one clock 
and then switch to another because they weren’t in step to begin with. 
But you can use as many stationary clocks (stationary with respect to you, 
the observer) as you please, for they are properly synchronized (moving 
observers would dispute this, but that’s their problem). 


Example 12.2. (The twin paradox) On her 21st birthday, an astronaut 
takes off in a rocket ship at a speed of He. After 5 years have elapsed 
on her watch, she turns around and heads back at the same speed to 
rejoin her twin brother, who stayed at home. Question: How old is each 


twin at their reunion? 


Solution. The traveling twin has aged 10 years (5 years out, 5 years 
back); she arrives at home just in time to celebrate her 31st birthday. 
However, as viewed from the Earth, the moving clock has been running 
slow by a factor 


1 13 
v= —— F 


v1 = (12/13) E 


The time elapsed on earthbound clocks is B x10 = 26, and her brother 
will be therefore celebrating his 47th birthday — he is now 16 years older 
than his twin sister! But don’t be deceived: This is no fountain of youth 
for the traveling twin; though she may die later than her brother, she 
will not have lived any more — she’s just done it slower. During the flight, 
all her biological processes — metabolism, pulse, thought, and speech 


—are subject to the same time dilation that affects her watch. 

The so-called twin paradox arises when you try to tell this story from 
the point of view of the traveling twin. She sees the Earth fly off at 
Le, turn around after 5 years, and return. From her point of view, it 
would seem, she’s at rest, whereas her brother is in motion, and hence 
it is he who should be younger at the reunion. An enormous amount 
has been written about the twin paradox, but the truth is there’s really 
no paradox here at all: this second analysis is simply wrong. The two 
twins are not equivalent. The traveling twin experiences acceleration 
when she turns around to head home, but her brother does not. To 
put it in fancier language, the traveling twin is not in an inertial 
system — more precisely, she’s in one inertial system on the way out 
and a completely different inertial system on the way back. You'll 
see in Prob. 12.16 how to analyze this problem correctly from her 
perspective, but as far as the resolution of the “paradox” is concerned, 
it is enough to note that the traveling twin cannot claim to be a 
stationary observer because you can’t undergo acceleration and remain 
stationary. 
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Problem 12.7. In a laboratory experiment, a muon is observed to travel 800 m before disin- 
tegrating. A graduate student looks up the lifetime of a muon (2 x 1076 s) and concludes that 
its speed was 
800m 
= = 
2x10%s 
Faster than light! Identify the student's error, and find the actual speed of this muon. 


4 x 108 m/s. 


Problem 12.8. A rocket ship leaves Earth at a speed of dc, When a clock on the rocket says 
1 hour has elapsed, the rocket ship sends a light signal back to Earth. 


(a) According to Farth clocks, when was the signal sent? 

(b) According to Earth clocks, how long after the rocket left did the signal arrive back on Earth? 

(c) According to the rocket observer, how long after the rocket left did the signal arrive back on 
Earth? 


(iii) Lorentz contraction. For the third gedanken experiment you must 
Imagine that we have set up a lamp at one end of a boxcar and a mirror 
at the other, so that a light signal can be sent down and back (Fig. 12.10). 
Question: How long does the signal take to complete the round trip? To an 
observer on the train, the answer is 


M23, (12.7) 
c 


where Ax is the length of the car (the overbar, as before, denotes measure- 
ments made on the train). To an observer on the ground, the process is 
more complicated because of the motion of the train. If At, is the time for 
the light signal to reach the front end, and At is the return time, then (see 
Fig. 12.11) 


Ax + vAt Ax — vAt 
We. hee. 
c 
or, solving for At; and Aty, 
A AD 
At, = x , Ab= x : 
c-v cty 
So the round-trip time is 
Ax 1 


At = At, + Ath = 2 (12.8) 


e (=v) 


vAt, vAt, 
a 


P peered 
Lamp YR 1 
©) © hecada AA I= 


4 L | 
v © (0) XO) fey) C0) 
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But these intervals are related by the time dilation formula, Eq. 12.5: 


At = V1 = Iet. 


Applying this to Eqs. 12.7 and 12.8, I conclude that 


1 

AX = —— Ax. (12.9) 

V1 -ve 

The length of the boxcar is not the same when measured by an observer on 

the ground, as it is when measured by an observer on the train — from the 
ground point of view, it is somewhat shorter.’ Conclusion: 


Moving objects are shortened. 


We call this Lorentz contraction. Notice that the same factor, 

1 
VIZ ve 
appears in both the time dilation formula and the Lorentz contraction 
formula. This makes it all very easy to remember: Moving clocks run slow, 
moving sticks are shortened, and the factor is always y. 

Of course, the observer on the train doesn’t think her car is shortened — 
her meter sticks are contracted by that same factor, so all her measurements 
come out the same as when the train was standing in the station. In fact, 
from her point of view it is objects on the ground that are shortened. This 
raises again a seeming paradox: if A says B’s sticks are short, and B says 
A’s sticks are short, who is right? Answer: They both are! But to reconcile 
the rival claims we must study carefully the actual process by which length 
is measured. 

Suppose you want to measure the length of a snake. If it’s at rest (with 
respect to you) you simply lay your ruler down next to the snake, record 
the readings at each end, and subtract (Fig. 12.12). (If you’re really clever, 
you'll line up the left end of the ruler with the tail of the snake — then you 
only have to read one number, and you can skip the subtraction.) 

But what if the snake is moving? Same story, only this time, of course, you 
must be careful to read the two ends at the same instant of time. If you don’t, 
the snake will move in the course of measurement, and obviously you'll get 
the wrong answer. But therein lies the problem: because of the relativity 


y= 


Snake 


7 Notice that I did not say it “appears” shorter, or it “seems” shorter, or it “looks” shorter. 
No, no, no! It is shorter, by any sensible definition of the term. 
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of simultaneity the two observers disagree on what constitutes “the same 
instant of time.” When the person on the ground measures the length of 
the boxcar, he reads the position of the two ends at the same instant in his 
system. But the person on the train, watching him do it, complains that he 
read the front end first, then waited a moment before reading the back end. 
Naturally, he came out short, in spite of the fact that (to her) he was using 
an undersized meter stick, which would otherwise have yielded a number 
too large. Both observers measure lengths correctly (from the point of view 
of their respective inertial frames), and each finds the other’s sticks to be 
shortened. Yet there is no inconsistency, for they are measuring different 
things, and each considers the other’s method improper. 


Example 12.3. (The barn and ladder paradox) Unlike time dilation, 
there is no direct experimental confirmation of Lorentz contraction, 
for the simple reason that it’s tough to get an object of measurable 
size going anywhere near the speed of light. The following parable 
illustrates how bizarre the world would be if the speed of light were 
more accessible. 

There once was a farmer who had a ladder too long to fit in his barn 
(Fig. 12.13a). He chanced one day to read some relativity, and a solu- 
tion to his problem suggested itself. He instructed his daughter to run 
with the ladder as fast as she could — the moving ladder having Lorentz- 
contracted to a size the barn could easily accommodate, she was to 
rush through the door, whereupon the farmer would slam it behind 
her, capturing the ladder inside (Fig. 12.13b). The daughter, however, 
has read somewhat further in the relativity book; she points out that 
in her reference frame the barn, not the ladder, will contract, and the 
fit will be even worse than it was with the two at rest (Fig. 12.13c). 
Question: Who’s right? Will the ladder fit inside the barn, or won’t it? 


Solution. They’re both right! When you say “the ladder is in the barn,” 
you mean that all parts of it are inside at one instant of time, but in view 
of the relativity of simultaneity, that’s a condition that depends on the 
observer. There are really two relevant events here: 


Wo al 
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a: Back end of ladder makes it in the door. 

b: Front end of ladder hits far wall of barn. 

The farmer says a occurs before b, so there is a time when the ladder is 
entirely inside the barn; his daughter says b precedes a, so there is not. 
Contradiction? Nope — just a difference in perspective. 

“But come now” (I hear you protest), “when it’s all over and the 
dust clears, either the ladder is inside the barn, or it isn’t. There can 
be no dispute about that.” Quite so, but now you’re introducing a new 
element into the story: What happens as the ladder is brought to a stop? 
Suppose the farmer grabs the last rung of the ladder firmly with one 
hand, while he slams the door with the other. Assuming it remains 
intact, the ladder must now stretch out to its rest length. Evidently, the 
front end keeps going, even after the rear end has stopped! Expanding 
like an accordion, the front end of the ladder smashes into the far 
side of the barn. In truth, the whole notion of a “rigid” object loses 
its meaning in relativity, for when it changes its speed, different parts 
do not in general accelerate simultaneously — in this way, the material 
stretches or shrinks to reach the length appropriate to its new velocity.® 

But to return to the question at hand, when the ladder finally comes 
to a stop, is it inside the barn or not? The answer is indeterminate. 
When the front end of the ladder hits the far side of the barn, 
something has to give, and the farmer is left either with a broken ladder 
inside the barn or with the ladder intact poking through a hole in the 
wall. In any event, he is unlikely to be pleased with the outcome. 


One final comment on Lorentz contraction. A moving object is short- 
ened only along the direction of its motion: 


Dimensions perpendicular to the velocity are not contracted. 


Indeed, in deriving the time dilation formula I took it for granted that 
the height of the train is the same for both observers. I'll now justify this, 
using a lovely gedanken experiment suggested by Taylor and Wheeler.’ 
Imagine that we build a wall beside the railroad tracks, and 1m above 
the rails (as measured on the ground), we paint a horizontal blue line. 
When the train goes by, a passenger leans out the window holding a wet 
paintbrush 1 m above the rails, as measured on the train, leaving a horizontal 
red line on the wall. Question: Does the passenger’s red line lie above or 
below our blue one? If the rule were that perpendicular directions contract, 
then the person on the ground would predict that the red line is lower, 
while the person on the train would say it’s the blue one (to the latter, of 
course, the ground is moving). The principle of relativity says that both 


8 For a related paradox see E. Pierce, Am. J. Phys. 75, 610 (2007). 

° E, F, Taylor and J. A. Wheeler, Spacetime Physics, 2nd ed., W. H. Freeman, San Francisco 
(1992). A somewhat different version of the same argument is given in J. H. Smith, 
Introduction to Special Relativity, Stipes, Champaign, IL (1965). 
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observers are equally justified, but they cannot both be right. No subtleties 
of simultaneity or synchronization can rationalize this contradiction; either 
the blue line is higher or the red one is — unless they exactly coincide, which 
is the inescapable conclusion. There cannot be a law of contraction (or 
expansion) of perpendicular dimensions, for it would lead to irreconcilably 
inconsistent predictions. 


Problem 12.9. A Lincoln Continental is twice as long as a VW Beetle, when they are at rest. 
As the Continental overtakes the VW, going through a speed trap, a (stationary) policeman 
observes that they both have the same length. The VW is going at half the speed of light. How 
fast is the Lincoln going? (Leave your answer as a multiple of c.) 


Problem 12.10. A sailboat is manufactured so that the mast leans at an angle O with respect 
to the deck. An observer standing on a dock sees the boat go by at speed v (Fig. 12.14). What 
angle does this observer say the mast makes? 


Fig. 12.14 Fig. 12.15 


Problem 12.11. A record turntable of radius R rotates at angular velocity w (Fig. 12.15). The 
circumference is presumably Lorentz-contracted, but the radius (being perpendicular to the 
velocity) is not. What’s the ratio of the circumference to the diameter, in terms of w and R? 
According to the rules of ordinary geometry, it has to be 7c. What's going on here?” 


12.1.3 The Lorentz Transformations 


Any physical process consists of one or more events. An “event” is some- 
thing that takes place at a specific location (x, y, z), at a precise time (7). 
The explosion of a firecracker, for example, is an event; a tour of Europe 
is not. Suppose we know the coordinates (x, y,z,f) of a particular event 
E in one inertial system S, and we would like to calculate the coordinates 


10 This is known as Ehrenfest’s paradox; for discussion and references, see H. Arzelies, 
Relativistic Kinematics, Pergamon Press, Elmsford, NY (1966), Chap. IX, or T. A. Weber, 
Am. J. Phys. 65, 486 (1997). 
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(X, Y, Z, f) of that same event in some other inertial system S. What we need 
is a “dictionary” for translating from the language of Sto the language of S. 

We may as well orient our axes as shown in Fig. 12.16, so that S slides 
along the x-axis at speed v. If we “start the clock” (t = 0) at the moment 
the origins (O and ©) coincide, then at time ¢, O will be a distance vt from 
O, and hence 


x=d+vt, (12.10) 


where d is the distance from © to A at time t (A is the point on the x-axis 
that is even with EF when the event occurs). Before Einstein, anyone would 
have said immediately that 


d=x, (12.11) 
and thus constructed the “dictionary” 


(i) Xx=x- vt, 


> AA (12.12) 
(iv) t=t. 


These are now called the Galilean transformations, though they scarcely 
deserve so fine a title — the last one, in particular, went without saying, 
since everyone assumed the flow of time is the same for all observers. In the 
context of special relativity, however, we must expect (iv) to be replaced 
by a rule that incorporates time dilation, the relativity of simultaneity, 
and the nonsynchronization of moving clocks. Likewise, there will be a 
modification in (i) to account for Lorentz contraction. As for (ii) and (iii), 
they, at least, remain unchanged, for we have already seen that there can be 
no modification of lengths perpendicular to the motion. 

But where does the classical derivation of (i) break down? Answer: 
In Eq. 12.11. For d is the distance from O to A as measured in S, whereas 
Xis the distance from Ø to A as measured in S. Because O and A are at rest 
in S, x is the “moving stick,” which is contracted as observed in S: 
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iea (12.13) 
y 


When this is inserted in Eq. 12.10 we obtain the relativistic version of (i): 
X = y(x- vt). (12.14) 


Of course, we could have run the same argument from the point of view 
of S. The diagram (Fig. 12.17) looks similar, but in this case it depicts the 
scene at time t, whereas Fig. 12.16 showed the scene at time t. (Note that t 
and f represent the same physical instant at E, but not elsewhere, because 
of the relativity of simultaneity.) If we assume that S also starts its clock 
when the origins coincide, then at time 7, O will be a distance vf from O, and 
therefore 


X=d-vi, (12.15) 
where d is the distance from © to A at time f, and A is that point on the x- 
axis that is even with E when the event occurs. The classical physicist would 
have said that x = d, and, using (iv), recovered (i). But, as before, relativity 
demands that we observe a subtle distinction: x is the distance from O to 
A in S, whereas d is the distance from O to A in S. Because O and A are at 
rest in S, x is the “moving stick,” and 


Jedy (12.16) 
y 
It follows that 
x= V(X + vf). (12.17) 


This last equation comes as no surprise, for the symmetry of the situation 
dictates that the formula for x, in terms of x and ż, should be identical to 
the formula for x in terms of x and ¢ (Eq. 12.14), except for a switch in the 
sign of v. (If S is going to the right at speed v, with respect to S, then S is 
going to the left at speed v, with respect to S.) Nevertheless, this is a useful 
result, for if we substitute x from Eq. 12.14, and solve for ¢, we complete 
the relativistic “dictionary”: 


527 


22. 


uo 


The Special Theory of Relativity 


(i) x= y(x- vt), 
(ii) y=y, 


(iii) z =z, (12.18) 


(iv) f= y(t- 5%): 


These are the famous Lorentz transformations, with which Einstein 
replaced the Galilean ones. They contain all the geometrical information in 
the special theory, as the following examples illustrate. The reverse dictio- 
nary, which carries you from S back to S, can be obtained algebraically by 
solving (i) and (iv) for x and ¢, or, more simply, by switching the sign of v: 

G) x= y(X+4 vd), 
Gi’) y=}, 
ii z =z, (12.19) 


(iv) 1=y(7+ 58). 


Example 12.4. (Simultaneity, synchronization, and time dilation) Sup- 
pose event A occurs at x4 = 0, ty = 0, and event B occurs at 
Xp = b, tg = 0. The two events are simultaneous in S (they both take 
place at ¢ = 0). But they are not simultaneous in S, for the Lorentz 
transformations give x4=0, t4=0 and xg=yb, tg = -y(v/c’)b. 
According to the S clocks, then, B occurred before A. This is nothing 
new, of course —just the relativity of simultaneity. But I wanted you to 
see how it follows from the Lorentz transformations. 

Suppose that at time ¢ = 0 observer S decides to examine all the 
clocks in S. He finds that they read different times, depending on their 
location; from (iv), 


Those to the left of the origin (negative x) are ahead, and those to the 
right are behind, by an amount that increases in proportion to their 
distance (Fig. 12.18). Only the master clock at the origin reads f=0. 
Thus, the nonsynchronization of moving clocks, too, follows directly 
from the Lorentz transformations. Of course, from the S viewpoint it 
is the S clocks that are out of synchronization, as you can check by 
putting f=0 into (iv’). 

Finally, suppose S focuses his attention on a single clock at rest in 
the S frame (say, the one at ¥=a), and watches it over some interval 
At. How much time elapses on the moving clock? Because x is fixed, 
Gv’) gives At = y At, or 


= 1 
At = —At. 
Y 
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S clocks 


CO 


S clocks 


That’s the old time dilation formula, derived now from the Lorentz 
transformations. Please note that it’s x we hold fixed, here, because 
we're watching one moving clock. If you hold x fixed, then you're 
watching a whole series of different S clocks as they pass by, and that 
won’t tell you whether any one of them is running slow. 


Example 12.5. (Lorentz contraction) Imagine a stick at rest in S (hence 
moving to the right at speed v in S). Its rest length (that is, its length as 
measured in S) is Ax = x, — X;, where the subscripts denote the right 
and left ends of the stick. If an observer in S were to measure the stick, 
he would subtract the positions of the two ends at one instant of his 
time t: Ax = x, — x; (for tı = t,). According to (i), then, 


Ax = ie 
Y 
This is the old Lorentz contraction formula. Note that it’s t we hold 
fixed, here, because we’re talking about a measurement made by S, and 
he marks off the two ends at the same instant of his time (S doesn’t 
have to be so fussy, since the stick is at rest in her frame). 


Example 12.6. (Einstein’s velocity addition rule) Suppose a particle 
moves a distance dx (in S) in a time dt. Its velocity u is then 


ax 
dt’ 
In S, meanwhile, it has moved a distance 
dx = y(dx — vdt) 


u 


(as we see from (i)), in a time given by (iv): 
= v 
dt = y(dt- = dx). 
The velocity in S is therefore 
dx y (dx — vdt) (dx/dt — v) u-v 


u= = = = = a 12.20 
“= T y(dt- vl? dx) 1-vledxldt 1- uve ( ) 
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This is Einstein’s velocity addition rule. To recover the more transparent 
notation of Eq. 12.3, let A be the particle, B be S, and C be S; then 
uU = VAB, U = VAC, and v = VcB = —VBc, SO Eq. 12.20 becomes 


VAB + VBC 
1+ (vapVpclc?) ` 


vac = 


Problem 12.12. Solve Eqs. 12.18 for x, y, z, tin terms of x, y, Z, t, and check that you recover 
Eqs. 12.19. 


Problem 12.13. Sophie Zabar, clairvoyant, cried out in pain at precisely the instant her 
twin brother, 500 km away, hit his thumb with a hammer. A skeptical Han observed 
both events (brother's accident, Sophie's cry) from an airplane traveling at 75 ne to the right 
(Fig. 12.19). Which event occurred first, according to the scientist? How much E was it, in 
seconds? 


Problem 12.14. 


(a) In Ex. 12.6 we found how velocities in the x direction transform when you go from S to S. 
Derive the analogous formulas for velocities in the y and z directions. 

(b) A spotlight is mounted on a boat so that its beam makes an angle @ with the deck 
(Fig. 12.20). If this boat is then set in motion at speed v, what angle O does an individual 
photon trajectory make with the deck, according to an observer on the dock? What angle 
does the beam (illuminated, say, by a light fog) make? Compare Prob. 12.10. 


Beam 
= < 
0 
—> v 


Brother Sophie 


500 km 
Fig. 12.19 Fig. 12.20 


Problem 12.15. You probably did Prob. 12.4 from the point of view of an observer on the 
ground. Now do it from the point of view of the police car, the outlaws, and the bullet. That 
is, fill in the gaps in the following table: 


speed of > 


Ground | Police | Outlaws | Bullet |} Do they escape? 


relative to | 


Ground 0 


c 


Aw 


c 


Vie 


Police 


we 


Outlaws 
Bullet 
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Problem 12.16. [The twin paradox revisited] On their 21st birthday, one twin gets on a 
moving sidewalk, which carries her out to star X at speed ic her twin brother stays at home. 
When the traveling twin gets to star X, she immediately jumps onto the returning moving 
sidewalk and comes back to Earth, again at speed te. She arrives on her 39th birthday (as 
determined by her watch). 


(a) How old is her twin brother? 
(b) How far away is star X? (Give your answer in light years.) 


Call the outbound sidewalk system Sand the inbound one S (the Earth system is S). All three 
systems choose their coordinates and set their master clocks such that x = x = x = 0, 
t = t = t = Oat the moment of departure. 


c 
d 
e 
f 


What are the coordinates (x, £) of the jump (from outbound to inbound sidewalk) in S? 

What are the coordinates (x, 7) of the jump in S? 

What are the coordinates (x, 7) of the jump in S? 

If the traveling twin wants her watch to agree with the clock in S, how must she reset it 

immediately after the jump? What does her watch then read when she gets home? (This 

wouldn't change her age, of course — she's still 39 — it would just make her watch agree 

with the standard synchronization in S) 

(g) If the traveling twin is asked the question, “How old is your brother right now?’, what is 
the correct reply (i) just before she makes the jump, and (ii) just after she makes the jump? 
(Nothing dramatic happens to her brother during the split second between (i) and (ii), of 
course; what does change abruptly is his sister’s notion of what “right now, back home” 
means.) 

(h) How many Earth years does the return trip take? Add this to (ii) from (g) to determine how 

old she expects him to be at their reunion. Compare your answer to (a). 


=a 
ee a 


12.1.4 The Structure of Space-Time 


(i) Four-vectors. The Lorentz transformations take on a simpler appearance 
when expressed in terms of the quantities 
v 


xY=ct, b=. (12.21) 
E 


Using x° (instead of #) and f (instead of v) amounts to changing the unit 
of time from the second to the meter: 1 meter of x? corresponds to the time 
it takes light to travel 1 meter (in vacuum). If, at the same time, we number 
the x, y, z coordinates, so that 


xl=x, = Vs XL =z, (12.22) 
then the Lorentz transformations read 
X = y(x - Bx), 
x! = y(x! — Bx), 


2 


(12.23) 
x Sx. 
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or, in matrix form, 


w y -yp 0 0\/x° 
x| |- 0 ojfjx 
ge z p a 
X 0 0 0 1/\x 


Letting Greek indices run from 0 to 3, this can be distilled into a single 
equation: 


3 
X = » ADs (12.25) 
v=0 


where A is the Lorentz transformation matrix in Eq. 12.24 (the superscript 
u labels the row, the subscript v labels the column). One virtue of writing 
things in this abstract manner is that we can handle in the same format 
a more general transformation, in which the relative motion is not along 
a common xx-axis; the matrix A would be more complicated, but the 
structure of Eq. 12.25 is unchanged. 

If this reminds you of the rotations we studied in Section 1.1.5, it’s no 
accident. There we were concerned with the change in components when 
you switch to a rotated coordinate system; here we are interested in the 
change of components when you go to a moving system. In Chapter 1 
we defined a (3-)vector as any set of three components that transform 
under rotations the same way (x, y,z) do; by extension, we now define a 
4-vector as any set of four components that transform in the same manner 


as (x9, x!, x2, x3) under Lorentz transformations: 
3 
a = py Ata’, (12.26) 
v=0 


For the particular case of a transformation along the x-axis, 


a = y(a° — Ba’), 


al y(al - par), 


@ = a, 


=a’. 


(12.27) 


There is a 4-vector analog to the dot product (A- B = A,B, + A,B, + 
A-B-), but it’s not just the sum of the products of like components; rather, 
the zeroth components have a minus sign: 


—- ab? +a + h +a’. (12.28) 


This is the four-dimensional scalar product; you should check for yourself 
(Prob. 12.17) that it has the same value in all inertial systems: 


-0P +a! h! + b + PD = ab? + alh! + abab (12.29) 
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just as the ordinary dot product is invariant (unchanged) under rotations, 
this combination is invariant under Lorentz transformations. !! 

To keep track of the minus sign, it is convenient to introduce the 
covariant vector a,,, which differs from the contravariant a" only in the sign 
of the zeroth component: 


Ay = (do, 41, 42, 43) = (-a,a', a’, a’). (12.30) 


You must be scrupulously careful about the placement of indices in this 
business: upper indices designate contravariant vectors; lower indices are 
for covariant vectors. Raising or lowering the temporal index costs a minus 
sign (a9 = —a°); raising or lowering a spatial index changes nothing (a; = 


a', a2 = a’, a3 = a*). Formally, 


1 000 
3 
; 0100 
au = ye where Suv = 0 01 0 (12.31) 
=0 
i 0001 


is the (Minkowski) metric.!” 
The scalar product can now be written with the summation symbol, 
3 


D, by 


u=0 
or, more compactly still, 
aby. (12.32) 


(Summation is implied whenever a Greek index is repeated in a product — 
once as a covariant index and once as contravariant. This is called the 
Einstein summation convention, after its inventor, who regarded it as one 
of his most important contributions.) Of course, we could just as well take 
care of the minus sign by switching to covariant b: 


a,b" = atb, = -a2b +a'b +b? + ab. (12.33) 


Problem 12.17. Check Eq. 12.29, using Eq. 12.27. [This only proves the invariance of the scalar 
product for transformations along the x direction. But the scalar product is also invariant under 
rotations, since the first term is not affected at all, and the last three constitute the three- 
dimensional dot product a - b. By a suitable rotation, the x direction can be aimed any way 
you please, so the four-dimensional scalar product is actually invariant under arbitrary Lorentz 
transformations.] 


11 In relativity, a scalar is a quantity that is invariant under Lorentz transformations as well 
as rotations. 

12 Tt doesn’t matter whether you define the scalar product as in Eq. 12.28 (—a%b° + a - b) or 
with an overall minus sign (a?b? — a - b); if one is invariant, so is the other. In the literature, 
both conventions are common, and you just have to be aware of which one is in use. If they 
write the diagonal components of the Minkowski metric as (—, +, +, +), they are using the 
convention in Eq. 12.28; otherwise they will write (+, —, —, —). 


0 
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Problem 12.18. 


(a) Write out the matrix that describes a Galilean transformation (Eq. 12.12). 

(b) Write out the matrix describing a Lorentz transformation along the y-axis. 

(c) Find the matrix describing a Lorentz transformation with velocity v along the x-axis 
followed by a Lorentz transformation with velocity y along the y-axis. Does it matter in 
what order the transformations are carried out? 


Problem 12.19. The parallel between rotations and Lorentz transformations is even more 
striking if we introduce the rapidity: 


@ = tanh! (v/c). (12.34) 


(a) Express the Lorentz transformation matrix A (Eq. 12.24) in terms of O, and compare it to 
the rotation matrix (Eq. 1.29). 


In some respects, rapidity is amore natural way to describe motion than velocity.” For one thing, 
it ranges from —co to +00, instead of —c to +c. More significantly, rapidities add, whereas 
velocities do not. 


(b) Express the Einstein velocity addition law in terms of rapidity. 


(ii) The invariant interval. The scalar product of a 4-vector with itself, 
ata, = —(a°)? + (a')? + (a’)? + (a*)’, can be positive (if the “spatial” terms 
dominate) or negative (if the “temporal” term dominates) or zero: 

If a"a, > 0, a" is called spacelike. 
If aa, < 0, a" is called timelike. 
If aa, = 0, a" is called lightlike. 

Suppose event A occurs at (x°,, x}, x7, x4), and event Bat (x9, xp, xh, X3). 

The difference, 


Ax! = x — Xh (12.35) 


is the displacement 4-vector. The scalar product of Ax" with itself is called 
the invariant interval between two events: 


T= (Ax)M (Ax), = (Ax? + (Ax!)? +(Ax’)? + (Ax? )? = 7? +d’, (12.36) 


where ¢ is the time difference between the two events and d is their spatial 
separation. When you transform to a moving system, the time between A 
and B is altered (T + ù), and so is the spatial separation (d + d), but the 
interval J remains the same. 

If the displacement between two events is timelike (7 < 0), there exists an 
inertial system (accessible by Lorentz transformation) in which they occur 


BER Taylor and J. A. Wheeler, Spacetime Physics, \st ed., W. H. Freeman, San Francisco 
(1966). 
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at the same point. For if I hop on a train going from A to B at the speed 
v = dit, leaving event A when it occurs, I shall be just in time to pass B 
when if occurs; in the train system, A and B take place at the same point. 
You cannot do this for a spacelike interval, of course, because v would have 
to be greater than c, and no observer can exceed the speed of light (y would 
be imaginary and the Lorentz transformations would be nonsense). On the 
other hand, if the displacement is spacelike (J > 0), then there exists a 
system in which the two events occur at the same time (see Prob. 12.21). 
And if the displacement is lightlike (7 = 0), then the two events could be 
connected by a light signal. 


Problem 12.20. 


(a) Event A happens at point (x4 = 5, y4 = 3,24 = 0)andattime¢, given by ct4 = 
15; event B occurs at (10, 8, 0) and ctg = 5, both in system S. 


(i) What is the invariant interval between A and B? 
(ii) Is there an inertial system in which they occur simultaneously? If so, find its velocity 
(magnitude and direction) relative to S. 
(iii) Is there an inertial system in which they occur at the same point? If so, find its velocity 
relative to S. 


(b) Repeat part (a) for A = (2,0, 0), ct = 1; and for B = (5,0, 0), ct = 3. 


Problem 12.21. The coordinates of event A are (x4, 0, 0), t4, and the coordinates of event 
Bare (xp, 0, 0), tg. Assuming the displacement between them is spacelike, find the velocity 
of the system in which they are simultaneous. 


(iii) Space-time diagrams. If you want to represent the motion of a 
particle graphically, the normal practice is to plot the position versus time 
(that is, x runs vertically and ¢ horizontally). On such a graph, the velocity 
can be read off as the slope of the curve. For some reason, the convention 
is reversed in relativity: everyone plots position horizontally and time (or, 
better, x° = ct) vertically. Velocity is then given by the reciprocal of the slope. 
A particle at rest is represented by a vertical line; a photon, traveling at 
the speed of light, is described by a 45° line; and a rocket going at some 
intermediate speed follows a line of slope c/v = 1/6 (Fig. 12.21). We call such 
plots Minkowski diagrams. 

The trajectory of a particle on a Minkowski diagram is called its world 
line. Suppose you set out from the origin at time ¢ = 0. Because no material 
object can travel faster than light, your world line can never have a slope less 
than 1. Accordingly, your motion is restricted to the wedge-shaped region 
bounded by the two 45° lines (Fig. 12.22). We call this your “future,” in 
the sense that it is the locus of all points accessible to you. Of course, as 
time goes on, and you move along your chosen world line, your options 
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Your future at t 


ay 


AC Particle _ | 
yoy at rest 


progressively narrow: your “future” at any moment is the forward “wedge” 
constructed at whatever point you find yourself. Meanwhile, the backward 
wedge represents your “past”; it is the locus of all points from which 
you might have come. As for the rest (the region outside the forward and 
backward wedges), this is the generalized “present.” You can’t get there, 
and you didn’t come from there. In fact, there’s no way you can influence 
any event in the present (the message would have to travel faster than light); 
it’s a vast expanse of space-time that is absolutely inaccessible to you. 

I’ve been ignoring the y and z directions. If we include a y-axis coming 
out of the page, the “wedges” become cones — and, with an undrawable 
z-axis, hyper-cones. Because their boundaries are the trajectories of light 
rays, we call them the forward light cone and the backward light cone. Your 
future, in other words, lies within your forward light cone, your past within 
your backward light cone. 

Notice that the slope of the line connecting two events on a space-time 
diagram tells you at a glance whether the displacement between them is 
timelike (slope greater than 1), spacelike (slope less than 1), or lightlike 
(slope 1). For example, all points in the past and future are timelike with 
respect to your present location, whereas points in the present are spacelike, 
and points on the light cone are lightlike. 

Hermann Minkowski, who was the first to recognize the full geometrical 
significance of special relativity, began a famous lecture in 1908 with the 
words “Henceforth space by itself, and time by itself, are doomed to fade 
away into mere shadows, and only a kind of union of the two will preserve 
an independent reality.”!* It’s a lovely thought, but you must be careful not 
to read too much into it. For it is not at all the case that time is “just 
another coordinate, on the same footing with x, y, and z” (except that 


14 A. Einstein et al., The Principle of Relativity, Dover, New York (1923), Chap. V. Minkowski 
had been Einstein’s mathematics professor, and he did not initially have a high regard for 
his former student: “Oh, that Einstein, always cutting lectures ..., I really would not have 
believed him capable of it.” Quoted by D. Overbye, Einstein in Love, Viking, New York 
(2000), p. 158. 
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Ya 


for obscure reasons we measure it on clocks instead of rulers). No: Time 
is utterly different from the others, and the mark of its distinction is the 
minus sign in the invariant interval. That minus sign imparts to space-time 
a hyperbolic geometry that is much richer than the circular geometry of 
3-space. 

Under rotations about the z-axis, a point P in the xy plane describes a 
circle: the locus of all points a fixed distance r = 4x? + y? from the origin 
(Fig. 12.23). Under Lorentz transformations, however, it is the interval 
I = (x? — Êf) that is preserved, and the locus of all points with a given 
value of I is a hyperbola — or, if we include the y-axis, a hyperboloid of 
revolution. When the displacement is timelike, it’s a “hyperboloid of two 
sheets” (Fig. 12.24a); when the displacement is spacelike, it’s a “hyperboloid 
of one sheet” (Fig. 12.24b). When you perform a Lorentz transformation 
(that is, when you go into a moving inertial system), the coordinates (x, t) 
of a given event will change to (x, f), but these new coordinates will lie 
on the same hyperbola as (x,t). By appropriate combinations of Lorentz 
transformations and rotations, a spot can be moved around at will over the 
surface of a given hyperboloid, but no amount of transformation will carry 
it, say, from the upper sheet of the timelike hyperboloid to the lower sheet, 
or to a spacelike hyperboloid. 
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When we were discussing simultaneity, I showed that the time ordering 
of two events can, at least in certain cases, be reversed, simply by going 
into a moving system. But we now see that this is not always possible: if 
the displacement 4-vector between two events is timelike, their ordering is 
absolute; if the interval is spacelike, their ordering depends on the inertial 
system from which they are observed. In terms of the space-time diagram, an 
event on the upper sheet of a timelike hyperboloid definitely occurred after 
(0, 0), and one on the lower sheet certainly occurred before; but an event on 
a spacelike hyperboloid occurred at positive t, or negative t, depending on 
your reference frame. This is not an idle curiosity; it rescues the notion of 
causality, on which all physics is based. If it were always possible to reverse 
the order of two events, then we could never say “A caused B,” since a rival 
observer would retort that B preceded A. This embarrassment is avoided, 
provided the two events are timelike or lightlike separated. And causally 
related events are — otherwise no influence could travel from one to the 
other. Conclusion: The displacement between causally related events is always 
timelike, and their temporal ordering is the same for all inertial observers. 


Problem 12.22. 


(a) Draw a space-time diagram representing a game of catch (or a conversation) between two 
people at rest, 10 ft apart. How is it possible for them to communicate, given that their 
separation is spacelike? 

(b) There's an old limerick that runs as follows: 


There once was a girl named Ms Bright, 
Who could travel much faster than light. 
She departed one day, 

The Einsteinian way, 

And returned on the previous night. 


What do you think? Even if she could travel faster than light, could she return before she set 
out? Could she arrive at some intermediate destination before she set out? Draw a space-time 
diagram representing this trip. 


Problem 12.23. Inertial system S moves in the x direction at speed dc relative to system S. 
(The x-axis slides long the x-axis, and the origins coincide at t = ¢ = 0, as usual.) 


(a) On graph paper set up a Cartesian coordinate system with axes c¢ and x. Carefully draw in 
lines representing x = —3, —2, —1, 0, 1, 2, and 3. Also draw in the lines corresponding 
to cf = —3, —2, —1, 0, 1, 2, and 3. Label your lines clearly. 

(b) In S, a free particle is observed to travel from the point x = —2 at time cf = —2 to the 
point x = 2 at cf = +3. Indicate this displacement on your graph. From the slope of this 
line, determine the particle's speed in S. 

(c) Use the velocity addition rule to determine the velocity in S algebraically, and check that 
your answer is consistent with the graphical solution in (b). 
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12.2 Relativistic Mechanics 
E) 


12.2.1 Proper Time and Proper Velocity 


As you progress along your world line, your watch runs slow; while the clock 
on the wall ticks off an interval dt, your watch only advances drt: 


dt = V1 — èle dt. (12.37) 


(TIl use u for the velocity of a particular object — you, in this instance — 
and reserve v for the relative velocity of two inertial systems.) The time 
t your watch registers (or, more generally, the time associated with the 
moving object) is called proper time. (The word suggests a mistranslation 
of the French “propre,” meaning “own.”) In some cases, T may be a more 
relevant or useful quantity than ¢. For one thing, proper time is invariant, 
whereas “ordinary” time depends on the particular reference frame you 
have in mind. 

Now, imagine you're on a flight to Los Angeles, and the pilot announces 
that the plane’s velocity is zc. What precisely does he mean by “velocity”? 
Well, of course, he means the displacement divided by the time: 


Al 


=F (12.38) 


and, since he is presumably talking about the velocity relative to ground, 
both dl and dt are to be measured by the ground observer. That’s the 
important number to know, if you’re concerned about getting there on time 
for your appointment, but if you’re wondering whether you'll be hungry on 
arrival, you might be more interested in the distance covered per unit proper 
time: 


(12.39) 


This hybrid quantity (distance measured on the ground, over time measured 
in the airplane) is called proper velocity; for contrast, Pll call u the ordinary 
velocity. The two are related by Eq. 12.37: 


1 = t (12.40) 


VIZI 
For speeds much less than c, of course, the difference between ordinary and 
proper velocity is negligible. 

From a theoretical standpoint, however, proper velocity has an enor- 
mous advantage over ordinary velocity: it transforms simply, when you go 
from one inertial system to another. In fact, ņ is the spatial part of a 4- 
vector, 

dx! 


H= — 
= (12.41) 
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whose zeroth component is 


dx® dt c 
n? = Sg = . (12.42) 
dt dt NI-I 
The numerator, dx", is a 4-vector, while the denominator, dT, is invariant. 
Thus, for instance, when you go from system S to system S, moving at speed 
v along the common xx-axis, 


F = y(n? - Bn’), 


=| 1 0 
ï =y -fr ), | v 1 ) 
E B=-, y= : (12.43) 
n =n, c hg 
=3 _ _3 : p 
T =T, 
More generally, 
nh = Abn”; (12.44) 


n” is called the proper velocity 4-vector, or simply the 4-velocity. 
By contrast, the transformation rule for ordinary velocities is quite 
cumbersome, as we found in Ex. 12.6 and Prob. 12.14: 


E dX — Uux-v 

“Y dt (= vuyle?)’ 

_ d uy 

” dt yA- vud) MAS) 
= dz _ uz 
Z dt y= vuyle2)’ 

or, more compactly, 
ai =- =Y a = (12.46) 


~ -uey a y(1 -— uvl) 
(where || denotes the component parallel to the relative motion (v), and 
L denotes components perpendicular to v). The reason for the added 
complexity is plain: we’re obliged to transform both the numerator dl and 
the denominator dt, whereas for proper velocity, the denominator dt is 
invariant, so the ratio inherits the transformation rule of the numerator 
alone. 


Problem 12.24. 


(a) Equation 12.40 defines proper velocity in terms of ordinary velocity. Invert that equation to 
get the formula for u in terms of 17. 

(b) What is the relation between proper velocity and rapidity (Eq. 12.34)? Assume the velocity 
is along the x direction, and find 77 as a function of 0. 


Problem 12.25. A car is traveling along the 45° line in S (Fig. 12.25), at (ordinary) speed 
(2/V5)c. 
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y on 
cu 


45° 
Fig. 12.25 


(a) Find the components ux and uy of the (ordinary) velocity. 
(b) Find the components 7), and n, of the proper velocity. 
(c) Find the zeroth component of the 4-velocity, Re 


System Sis moving in the x direction with (ordinary) speed V2/5c, relative to S. By using the 
appropriate transformation laws: 


(d) find the (ordinary) velocity components x and Uy in S: 
(e) find the proper velocity components 7), and 77) in S; and 
(f) as a consistency check, verify that 
TO u 
VI = le 
e Problem 12.26. Find the invariant product of the 4-velocity with itself, n“ n u. Is n” timelike, 
spacelike, or lightlike? 


Problem 12.27. A police officer pulls you over and asks what speed you were going. “Well, 
officer, | cannot tell a lie: the speedometer read 4 x 108 m/s.” You get a ticket, because the 
speed limit on this highway is 2.5 x 108 m/s. In court, your lawyer (who, luckily, has studied 
physics) points out that a car's speedometer measures proper velocity, whereas the speed limit 
is ordinary velocity. Guilty, or innocent? 


Problem 12.28. Consider a particle in hyperbolic motion, 


SOE NO ER yy — 2 


(a) Find the proper time 7 as a function of ¢, assuming the clocks are set so that t = O when 
t = 0. [Hint: Integrate Eq. 12.37.] 

(b) Find x and u = dx/dt (ordinary velocity) as functions of T. 

(c) Find n” (proper velocity) as a function of T. 


12.2.2 Relativistic Energy and Momentum 


In classical mechanics, momentum is mass times velocity. I would like to 
extend this definition to the relativistic domain, but immediately a question 
arises: Should I use ordinary velocity or proper velocity? In classical physics, 
n and u are identical, so there is no a priori reason to favor one over the 
other. However, in the context of relativity it is essential that we use proper 
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velocity, for the law of conservation of momentum would be inconsistent 
with the principle of relativity if we were to define momentum as mu (see 
Prob. 12.29). Thus 


m 
VI l 


this is the relativistic momentum of an object of mass m traveling at 
(ordinary) velocity u.!> 
Relativistic momentum is the spatial part of a 4-vector, 


p = my = (12.47) 


p" = më, (12.48) 
and it is natural to ask what the temporal component, 
0 0 mc 
p =m} = ——., (12.49) 
V1 - w/c? 
represents. Einstein identified p°c as relativistic energy: 
2 
mce 
E = ——; (12.50) 
V1 - w/c? 


p" is called the energy-momentum 4-vector (or the momentum 4-vector, for 
short). 

Notice that the relativistic energy is nonzero even when the object is 
stationary; we call this rest energy: 


Evest = ME. (12.51) 


The remainder, which is attributable to the motion, is kinetic energy, 


Exin = E- mc? = me[ (12.52) 


T. 
———. - 1}. 
1 - w/c? 

In the nonrelativistic régime (u « c) the square root can be expanded in 
powers of u/c”, giving 


1 
EN A eke (12.53) 
C 


the leading term reproduces the classical formula. 
So far, this is all just notation. The physics resides in the experimental 
fact that E and p, as defined by Eqs. 12.47 and 12.50, are conserved: 


In every isolated!® system, the total relativistic energy and momentum 
are conserved. 


Mass is not conserved — a fact that has been painfully familiar to everyone 
since 1945 (though the so-called “conversion of mass into energy” is really 
a conversion of rest energy into kinetic energy). 


15 Older treatments introduce the so-called relativistic mass, m, = m/V1 — u2/c2, so p can be 
written as m,u, but this unhelpful extra terminology has gone the way of the two-dollar bill. 
If there are external forces at work, then (just as in the classical case) the energy and 
momentum of the system itself will not, in general, be conserved. 


16 
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Note the distinction between an invariant quantity (same value in all 
inertial systems) and a conserved quantity (same value before and after some 
process). Mass is invariant but not conserved; energy is conserved but not 
invariant; electric charge is both conserved and invariant; velocity is neither 
conserved nor invariant. 

The scalar product of p” with itself is 


P'py = -°P + (pp) = -m e, (12.54) 


as you can quickly check using the result of Prob. 12.26. In terms of the 
relativistic energy and momentum, 


BP - pe =m. (12.55) 


This result is extremely useful, for it enables you to calculate E (if you know 
p = |p|), or p (knowing E), without ever having to determine the velocity. 1 


Problem 12.29. 


(a) Repeat Prob. 12.2a using the (incorrect) definition p = mu, but with the (correct) Einstein 
velocity addition rule. Notice that if momentum (so defined) is conserved in S, it is not 
conserved in S. Assume all motion is along the x-axis. 

(b) Now do the same using the correct definition, p = nn. Notice that if momentum (so 
defined) is conserved in S, it is automatically also conserved in S. [Hint: Use Eq. 12.43 to 
transform the proper velocity.] What must you assume about relativistic energy? 


Problem 12.30. Ifa particle's kinetic energy is 7 times its rest energy, what is its speed? 
Problem 12.31. Suppose you have a collection of particles, all moving in the x direction, with 
energies E1, E2, E3, . . . and momenta pj, p2, p3,... . Find the velocity of the center of 
momentum frame, in which the total momentum is zero. 


12.2.3 Relativistic Kinematics 


In this section we'll explore some applications of the conservation laws. 


Example 12.7. Two lumps of clay, each of (rest) mass m, collide head- 
on at žc (Fig. 12.26). They stick together. Question: What is the mass 
(M) of the composite lump? 


17 Equations 12.54 and 12.55 apply to a single particle of mass m. If you're talking about the 
total energy and momentum of a collection of particles, pp, is still an invariant, and you 
can use it to define the so-called invariant mass (Pë pyle’) of the system, but this will not 
(in general) be the sum of the individual masses. 
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G 3/5 c 3/5 c S ED 


m m M 


(Before) (After) 


Solution. In this case conservation of momentum is trivial: zero 
before, zero after. The energy of each lump prior to the collision is 


me? $ a 


— = -me 
V1- Gis)? 4 

and the energy of the composite lump after the collision is Mc? (since 

it’s at rest). So conservation of energy says 


5 
“me + “me = Me’, 
4 4 


and hence 


M=3m. 


Notice that this is greater than the sum of the initial masses! Mass was 
not conserved in this collision; kinetic energy was converted into rest 
energy, so the mass increased. 

In the classical analysis of such a collision, we say that kinetic 
energy was converted into thermal energy — the composite lump is 
hotter than the two colliding pieces. This is, of course, true in the 
relativistic picture too. But what is thermal energy? It’s the sum total 
of the random kinetic and potential energies of all the atoms and 
molecules in the substance. Relativity tells us that these internal ener- 
gies are reflected in the mass of the composite object: a hot potato is 
heavier than a cold potato, and a compressed spring is Heavier than a 
relaxed spring. Not by much, it’s true — internal energy (U) contributes 
an amount U/c” to the mass, and c’ is a very large number by everyday 
standards. You could never get two lumps of clay going anywhere near 
fast enough to detect the nonconservation of mass in their collision. 
But in the realm of elementary particles, the effect can be very striking. 
For example, when the neutral pi meson (mass 2.4 x 10-78 kg) decays 
into an electron and a positron (each of mass 9.11 x 1073! kg), the 
rest energy is converted almost entirely into kinetic energy — less than 
1 percent of the original mass remains. 


In classical mechanics, there’s no such thing as a massless (m = 0) parti- 
cle — its kinetic energy (4m?) and its momentum (mu) would be zero, you 
couldn’t apply a force to it (F = ma), and hence (by Newton’s third law) 
it couldn’t exert a force on anything else — it’s a cipher, as far as physics 
is concerned. You might at first assume that the same is true in relativity; 
after all, p and E are still proportional to m. However, a closer inspection of 
Eqs. 12.47 and 12.50 reveals a loophole worthy of a congressman: if u = c, 
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then the zero in the numerator is balanced by a zero in the denominator, 
leaving p and E indeterminate (zero over zero). It is just conceivable, 
therefore, that a massless particle could carry energy and momentum, 
provided it always travels at the speed of light. Although Eqs. 12.47 and 12.50 
would no longer suffice to determine E and p, Eq. 12.55 suggests that the 
two should be related by 


E = pe. (12.56) 


Personally, I would regard this argument as a joke, were it not for the fact 
that at least one massless particle is known to exist in nature: the photon.!® 
Photons do travel at the speed of light, and they obey Eq. 12.56.!? They 
force us to take the “loophole” seriously. (By the way, you might ask what 
distinguishes a photon with a lot of energy from one with very little — after 
all, they have the same mass (zero) and the same speed (c). Relativity offers 
no answer to this question; curiously, quantum mechanics does: According 
to the Planck formula, E = hv, where A is Planck’s constant and v is the 
frequency. A blue photon is more energetic than a red one!) 


Example 12.8. A pion at rest decays into a muon and a neutrino 
(Fig. 12.27). Find the energy of the outgoing muon, in terms of the 
two masses, my and m, (assume m, = 0). 


Solution. In this case, 


2 — 
Evefore = MnC”, Pbefore = 0, 
Eafter = Ey + Ey, Pafter = Pu + Py- 


Conservation of momentum requires that p, = — p,,. Conservation of 
energy says that 


EB, + Ey = Me. 


Now, E,=]|p,lc, by Eq. 12.56, whereas |p,|=./Ej —mjctlc, by 


Eq. 12.55, so 
E, + yE; = ne = me, 


èm H 


(Before) (After) 


Until fairly recently, neutrinos were also assumed to be massless, but experiments in 1998 


indicate that they in fact carry a (very small) mass. 

The photon is the quantum of the electromagnetic field, and it is no accident that the 
same ratio between energy and momentum holds for electromagnetic waves (see Eqs. 9.61 
and 9.63). 
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from which it follows that 
(m; +m)? 
u O 


2my 


In a classical collision, momentum and mass are always conserved, 
whereas kinetic energy, in general, is not. A “sticky” collision generates heat 
at the expense of kinetic energy; an “explosive” collision generates kinetic 
energy at the expense of chemical energy (or some other kind). If the kinetic 
energy is conserved, as in the ideal collision of the two billiard balls, we call 
the process “elastic.” In the relativistic case, momentum and total energy 
are always conserved, but mass and kinetic energy, in general, are not. Once 
again, we call the process elastic if kinetic energy is conserved. In such a case 
the rest energy (being the total minus the kinetic) is also conserved, and 
therefore so too is the mass. In practice, this means that the same particles 
come out as went in. Examples 12.7 and 12.8 were inelastic processes; the 
next one is elastic. 


Example 12.9. (Compton scattering) A photon of energy Eo 
“bounces” off an electron, initially at rest. Find the energy E of 
the outgoing photon, as a function of the scattering angle 0 (see 
Fig. 12.28). 


Solution. Conservation of momentum in the “vertical” direction gives 
pesin È = pp sin 0, or, since pp = Elc, 


E 
sing = — sin 0. 
Pe€ 


Conservation of momentum in the “horizontal” direction gives 


Eo E E 2 
— z 0 + Pe = 0 + De 1 — (— si 0 > 
7 Pp COS Pe cos Q z cos p Ce sin ) 


or 


pec’ = (Eo — Ecos 0)? + E’ sin? 0 = Ef — 25E cos 0 + F’. 


Photon 
E 
Ey 6 
Photon 4 $ 
Electron Electron 


(Before) (After) 
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Finally, conservation of energy says that 


Bot me? = E+ E, = E + nbc + pie 
=E+ mt + E? — 2EE cos O + E*. 


Solving for E, I find that 


1 
E= ; 12.57 
(1 — cos 0)/me + (1/Ep) ( ) 


The answer looks nicer when expressed in terms of photon wavelength: 
he 


fehe=s = 
hy 7° 


/ 
A = Ape ti - cos 0). (12.58) 
mc 


The quantity (4/mc) is called the Compton wavelength of the electron. 


Problem 12.32. Find the velocity of the muon in Ex. 12.8. 


Problem 12.33. A particle of mass 72 whose total energy is twice its rest energy collides with 
an identical particle at rest. If they stick together, what is the mass of the resulting composite 
particle? What is its velocity? 


Problem 12.34. A neutral pion of (rest) mass 7 and (relativistic) momentum p = 3me 
decays into two photons. Suppose one of the photons is emitted in the same direction as the 
original pion, and the other in the opposite direction. Find the (relativistic) energy of each 
photon. 


O~ O O~- O 


Target 


(a) (b) 
Fig. 12.29 


Problem 12.35. In the past, most experiments in particle physics involved stationary targets: 
one particle (usually a proton or an electron) was accelerated to a high energy Æ, and collided 
with a target particle at rest (Fig. 12.29a). Far higher relative energies are obtainable (with 
the same accelerator) if you accelerate both particles to energy Æ, and fire them at each 
other (Fig. 12.29b). Classically, the energy E of one particle, relative to the other, is just 4£ 
(why?) ... not much of a gain (only a factor of 4). But relativistically the gain can be enormous. 
Assuming the two particles have the same mass, 7m, show that 
JR A 


Ds eae (12.59) 
mc 
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Suppose you use protons (mc? = 1 GeV) with E = 30 GeV. What Æ do you get? What 
multiple of Æ does this amount to? (1 GeV = 10° electron volts.) [Because of this relativis- 
tic enhancement, most modern elementary particle experiments involve colliding beams, 
instead of fixed targets.] 


Problem 12.36. In a pair annihilation experiment, an electron (mass 772) with momentum 
De hits a positron (same mass, but opposite charge) at rest. They annihilate, producing two 
photons. (Why couldn't they produce just one photon?) If one of the photons emerges at 60° 
to the incident electron direction, what is its energy? 


12.2.4 Relativistic Dynamics 


Newton’s first law is built into the principle of relativity. His second law, in 
the form 


F- (12.60) 


retains its validity in relativistic mechanics, provided we use the relativistic 
momentum. 


Example 12.10. (Motion under a constant force) A particle of mass m 
is subject to a constant force F. If it starts from rest at the origin, at 
time ż = 0, find its position (x), as a function of time. 


Solution. 

d 

P F> p = Ft + constant, 

dt 
but since p = 0 at t = 0, the constant must be zero, and hence 

mu 
p= —— =Ft 
V1 — w/c? 
Solving for u, I find 
Fim)t 
ee a (12.61) 


V1 + (Filme)? 


The numerator, of course, is the classical answer — it’s approximately 
right, if (F/m)t is much less than c. But the relativistic denominator 
ensures that u never exceeds c; in fact, as £ > œ, u > c. 

To complete the problem we must integrate again: 


t , 
x(t) = Ef f dt’ 
m Jo 41 + (Ft’/mc)? 


me t mc 
= , 2) e 
= 4/1 + (Ft’/mc) |; =F 


af 1 + (Ftlmc)* — 1. (12.62) 
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ct 4 


Relativistic 
(hyperbola) 


\ Classical 


(parabola) 


In place of the classical parabola, x(t) = (F/2m)t?, the graph is a 
hyperbola (Fig. 12.30); for this reason, motion under a constant force 
is called hyperbolic motion. It occurs, for example, when a charged 
particle is placed in a uniform electric field. 


Work, as always, is the line integral of the force: 
W= fe -dl (12.63) 


The work—energy theorem (“the net work done on a particle equals the 
increase in its kinetic energy”) holds relativistically: 


7 dp B dp d dp 


while 
dp ee d mu ) 
dt dt NI Z Ie 
mu da d mc dE 
~ (wes? dt di aa) ~ dt? oe 
so 
W = i Z dt = Efnat — Einitial- (12.65) 


(Since the rest energy is constant, it doesn’t matter whether we use the total 
energy, here, or the kinetic energy.) 

Unlike the first two, Newton’s third law does not, in general, extend to 
the relativistic domain. Indeed, if the two objects in question are separated 
in space, the third law is incompatible with the relativity of simultaneity. For 
suppose the force of A on B at some instant t is F(t), and the force of B on 
A at the same instant is -F (ż); then the third law applies in this reference 
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frame. But a moving observer will report that these equal and opposite 
forces occurred at different times; in this system, therefore, the third law is 
violated. Only in the case of contact interactions, where the two forces are 
applied at the same physical point (and in the trivial case where the forces 
are constant) does the third law hold. 

Because F is the derivative of momentum with respect to ordinary time, 
it shares the ugly behavior of (ordinary) velocity, when you go from one 
inertial system to another: both the numerator and the denominator must 
be transformed. Thus,”” 


_ dp dpy dp,ldt F, 
fie L = — x Te 2 E (12.66) 
-— d. 1- == ~x 
pa g y( c a) 
and similarly for the z component: 
fo 
Z y(1 = Buxle)’ 
The x component is even worse: 
dp; „dp? P (dE 
=. Ps Yi- yBo “ag Page _ Mae) 
~“ dt yp f dx 1 — Buyle 
y dt = E dx 1- F at 
We calculated dE/dt in Eq. 12.64; putting that in, 
= F, — B(u- F)/c 
Fy = ——— 12.67 
l 1 — Buxle ( ) 


In one special case these equations are reasonably tractable: if the particle 
is (instantaneously) at rest in S, so that u = 0, then 


1 _ 
FL= OF, Fu=Fi. (12.68) 


That is, the component of F parallel to the motion of S is unchanged, 
whereas perpendicular components are divided by y. 

It has perhaps occurred to you that we could avoid the bad trans- 
formation behavior of F by introducing a “proper” force, analogous to 
proper velocity, which would be the derivative of momentum with respect 
to proper time: 


u ap" 
= 
This is called the Minkowski force; it is plainly a 4-vector, since p” is a 
4-vector and proper time is invariant. The spatial components of K” are 
related to the “ordinary” force by 
dt \ dp 1 
= a) ae = Fa (12.70) 


20 Remember: y and £ pertain to the motion of S with respect S— they are constants; u is the 
velocity of the particle with respect to S. 


K (12.69) 
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while the zeroth component, 


7 dp? _ldE 
~ dt cdt 
is, apart from the 1/c, the (proper) rate at which the energy of the particle 
increases — in other words, the (proper) power delivered to the particle. 
Relativistic dynamics can be formulated in terms of the ordinary force 
or in terms of the Minkowski force. The latter is generally much neater, 
but since in the long run we are interested in the particle’s trajectory as a 
function of ordinary time, the former is often more useful. When we wish 
to generalize some classical force law, such as Lorentz’s, to the relativistic 
domain, a question arises: Does the classical formula correspond to the 
ordinary force or to the Minkowski force? In other words, should we write 


K? (12.71) 


F = q(E+uxB), 


or should it rather be 
K = q(E+uxB)? 


Since proper time and ordinary time are identical in classical physics, there 
is no way at this stage to decide the issue. The Lorentz force, as it turns out, 
is an ordinary force — later on I'll explain why this is so, and show you how 
to construct the electromagnetic Minkowski force. 


Example 12.11. The typical trajectory of a charged particle in a uni- 
form magnetic field is cyclotron motion (Fig. 12.31). The magnetic force 
pointing toward the center, 


F = QuB, 


provides the centripetal acceleration necessary to sustain circular 
motion. Beware, however — in special relativity the centripetal force 
is not mu?/R, as in classical mechanics. Rather, as you can see from 
Fig. 12.32, 


dp = p d0, so 
dp dé u 
F=—e= — = p—. 
dt Pd PR 
do 
; 
p a p + dp 
Kw. 
- p+dp d 
u 
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(Classically, of course p = mu, so F = mu?/R.) Thus, 


u 
B=p— 
Qu PR 


p = OBR. (12.72) 


In this form, the relativistic cyclotron formula is identical to the 


nonrelativistic one, Eq. 5.3 — the only difference is that p is now the 
relativistic momentum. 


In classical mechanics, the total momentum (P) of a collection of 
interacting particles can be expressed as the total mass (M) times the 
velocity of the center of mass: 


aR», 
dt ` 
In relativity the center of mass (Rm = + È miri) is replaced by the center 
of energy (R, = $ > Eir;, where E is the total energy), and M by Ele: 


_ E dR. 

e dt” 
The quantity P now includes all forms of momentum, and Æ all forms of 
energy — not just mechanical, but also whatever may be stored in the fields.7! 
In particular, if the center of energy is at rest, the total momentum of the 
system must be zero,” as you would certainly expect. 


P=M 


(12.73) 


Example 12.12. In Ex. 8.3 we found that the momentum stored in the 
fields of a coaxial cable is not zero, even though the cable itself is at 
rest. At the time, this seemed paradoxical. However, energy is being 
transported, from the battery to the resistor, and hence the center of 
energy is in motion. Indeed, if the battery is at z = 0, so the resistor 
is at z = /, then R, = (£oRo + Ep/z)/E, where Ep is the energy in the 
resistor, Ey is any energy not contained in the battery or the resistor, 
and Ro is the center of energy of Ey, so 


dRe _ (dErldùl, _ IVI, 


2 = 
dt E E 
According to Eq. 12.73, then, the total momentum should be 
IVI. 
P= z” 


which is exactly the momentum in the fields, as calculated in Ex. 8.3. 
If this still seems strange to you, imagine a shoebox, with a marble 
inside that we cannot see. The box is at rest, but the marble is rolling 


21 The proof of Eq. 12.73 is not trivial. See S. Coleman and J. H. Van Vleck, Phys. Rev. 171, 
1370 (1968) or M. G. Calkin, Am. J. Phys. 39, 513 (1971). 
22 This is sometimes called the center-of-energy theorem. 
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from one end to the other. Is there momentum in this system? Yes, 
of course there is; even though the box is stationary, there’s still the 
momentum of the marble. In the case of the coaxial cable, no actual 
object is in motion (well, the electrons are, but there are just as many of 
them going one way as the other, and their net momentum is zero), but 
energy is flowing from one end to the other, and in relativity all forms 
of energy in motion, not just rest energy (mass), constitute momentum. 
The “marble” (in this analogy) is the electromagnetic field, which 
transports energy, and therefore contributes momentum... even 
though the fields themselves are perfectly static”? 


In the following example, the center of energy is at rest, so the total 


momentum must be zero (Eq. 12.73). But the (static) electromagnetic fields 
do carry momentum, and the problem is to locate the compensating 
mechanical momentum. 


23 


24 


Example 12.13. As a model for a magnetic dipole m, consider a rect- 
angular loop of wire carrying a steady current /. Picture the current 
as a stream of noninteracting positive charges that move freely within 
the wire. When a uniform electric field E is applied (Fig. 12.33), the 
charges accelerate in the left segment and decelerate in the right one.” 
Find the total momentum of all the charges in the loop. 


Solution. The momenta of the left and right segments cancel, so 
we need only consider the top and the bottom. Say there are N, 
charges in the top segment, going at speed u, to the right, and 
N_ charges in the lower segment, going at (slower) speed u_ to 
the left. The current (J = Au) is the same in all four segments 
(or else charge would be piling up somewhere); in particular, 


O I O 
, E 
v Jolte lof 
O a= O 
O0000000000 
su 


l 


This problem was incorrectly analyzed in the third edition — see T. H. Boyer, Am. J. Phys. 


76, 190 (2008). 


This is not a very realistic model for a current-carrying wire, obviously, but other models 


lead to exactly the same result. See V. Hnizdo, Am. J. Phys. 65, 92 (1997). 
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_ ON, 5 oN 
aay u 7 u, 80 Nats = o> 


where Q is the charge of each particle, and / is the length of the 
rectangle. Classically, the momentum of a single particle is p = Mu 
(where M is its mass), and the total momentum (to the right) is 


Il Il 
Poassica = MNu — MN_u_ = MJ — MJ =0, 


as one would certainly expect (after all, the loop as a whole is not 
moving). But relativistically, p = y Mu, and we get 


I 


MII 
p = y+MN+u, - y-MN_u_ = yg =7-); 


which is not zero, because the particles in the upper segment are moving 
faster. 

In fact, the gain in energy (y Mc’), as a particle ascends the left 
segment, is equal to the work done by the electric force, QEw, where w 
is the height of the rectangle, so 

Ew 
and hence 
TLEw 


c2 


But J/w is the magnetic dipole moment of the loop; as vectors, m points 
into the page and p is to the right, so 


p= l mxE). (12.74) 
c 

Thus a magnetic dipole at rest in an electric field carries linear momen- 

tum, even though it is not moving! This so-called hidden momentum 

is strictly relativistic, and purely mechanical; it precisely cancels the 

electromagnetic momentum stored in the fields (Eq. 8.45).”° 


Problem 12.37. In classical mechanics, Newton's law can be written in the more familiar 
form F = ma. The relativistic equation, F = dp/dt, cannot be so simply expressed. Show, 
rather, that 


7 m u(u - a) 
F= oa a (12.75) 


where a = du/dt is the ordinary acceleration. 


Problem 12.38. Show that it is possible to outrun a light ray, if you're given a sufficient head 
start, and your feet generate a constant force. 


25 This model is due to P. Penfield, Jr. and H. A. Haus, Electrodynamics of Moving Media, 
MIT Press, Cambridge, MA (1967), pp. 214-216. For more on hidden momentum, look 
again at Prob. 8.8, and the reference cited there. 
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Problem 12.39. Define proper acceleration in the obvious way: 


nai d xt! 
T dt dt" 
(a) Find a° and æ in terms of u and a (the ordinary acceleration). 
(b) Express æ „aæ¥ in terms of uand a. 
(c) Show that na, = 0. 
(d) Write the Minkowski version of Newton's second law, Eq. 12.69, in terms of a”. Evaluate the 
invariant product K” n u. 


(12.76) 


Problem 12.40. Show that 
1 — (w/c?) cos? 8 p2 


K,K# = 
[e 1 = êle? 


where @ is the angle between u and F. 


Problem 12.41. Show that the (ordinary) acceleration of a particle of mass 77 and charge q, 
moving at velocity u under the influence of electromagnetic fields E and B, is given by 


1 
a= N -I° |E+uxB- —u(u- E)]. 
m é 
[Hint: Use Eq. 12.75.] 


Problem 12.42. Generalize the strategy of Ex. 12.13 to a stationary current loop of arbitrary 
shape in a static electric field that is not necessarily uniform, to show that the hidden momen- 


tum can be written as 
I 
Phid = —3 $ V dl, 


where V is the scalar potential. For surface and volume currents this becomes 
1 1 
Phid = -5f VK da, Phid = -5f VJ dt. (12.77) 
Compare Eq. 8.46, and comment. 


Problem 12.43. In Prob. 8.26 the center of energy was at rest, but the (static) fields carried 
momentum. Use Eq. 12.77 to calculate the hidden momentum in this configuration, and con- 
firm that it exactly balances the field momentum. What does this do to the conservation of 
momentum, which we painstakingly confirmed in parts (c), (d), and (e) of that problem? 
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12.3.1 Magnetism as a Relativistic Phenomenon 


Unlike Newtonian mechanics, classical electrodynamics is already con- 
sistent with special relativity. Maxwell’s equations and the Lorentz force 
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law can be applied legitimately in any inertial system. Of course, what 
one observer interprets as an electrical process, another may regard as 
magnetic, but the actual particle motions they predict will be identical. To 
the extent that this did not work out for Lorentz and others, who studied the 
question in the late nineteenth century, the fault lay with the nonrelativis- 
tic mechanics they used, not with the electrodynamics. Having corrected 
Newtonian mechanics, we are now in a position to develop a complete and 
consistent formulation of relativistic electrodynamics. I emphasize that we 
will not be changing the rules of electrodynamics in the slightest — rather, 
we will be expressing these rules in a notation that exposes and illuminates 
their relativistic character. As we go along, I shall pause now and then 
to rederive, using the Lorentz transformations, results obtained earlier by 
more laborious means. But the main purpose of this section is to provide 
you with a deeper understanding of the structure of electrodynamics — 
laws that had seemed arbitrary and unrelated before, take on a kind of 
coherence and inevitability when approached from the point of view of 
relativity. 

To begin with, I’d like to show you why there had to be such a thing 
as magnetism, given electrostatics and relativity, and how, in particular, 
you can calculate the magnetic force between a current-carrying wire and 
a moving charge without ever invoking the laws of magnetism.*° Suppose 
you had a string of positive charges moving along to the right at speed v. 
Pll assume the charges are close enough together so that we may treat them 
as a continuous line charge A. Superimposed on this positive string is a 
negative one, —A proceeding to the left at the same speed v. We have, then, 
a net current to the right, of magnitude 


I = 2v. (12.78) 


Meanwhile, a distance s away there is a point charge q traveling to the right 
at speed u < v (Fig. 12.34a). Because the two line charges cancel, there is no 
electrical force on q in this system (S). 

However, let’s examine the same situation from the point of view of 
system S, which moves to the right with speed u (Fig. 12.34b). In this 
reference frame, q is at rest. By the Einstein velocity addition rule, the 
velocities of the positive and negative lines are now 

vFu 


(12.79) 


v = m; 
1 F vulc? 


Because v_ is greater than v,, the Lorentz contraction of the spacing 
between negative charges is more pronounced than that between positive 
charges; in this frame, therefore, the wire carries a net negative charge! 
In fact, 


Az = +(Vx)Ao, (12.80) 


26 This and several other arguments in this section are adapted from E. M. Purcell’s Electricity 
and Magnetism, 2nd ed., McGraw-Hill, New York (1985). 
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where 
1 


y = — (12.81) 
afl — v2/c? 


and Aq is the charge density of the positive line in its own rest system. That’s 
not the same as A, of course — in S they’re already moving at speed v, so 


À = yà, (12.82) 
where 
1 
= —__.. (12.83) 
í V1 — v2/c? 
It takes some algebra to put y+ into the simplest form: 
1 eF uv 
Vt = = 
ite Sv Fu)?(1 F vul?) Ve? F uv)? -° (vF u) 
e Fuv 1 F uvl? 
= = i (12.84) 
N (c2 — v2)(c? — u?) eri — w/c? 
The net line charge in S, then, is 
—2Auv 
Atot = Ax +A_= No(V+ = y-) = ON Towle (12.85) 


Conclusion: As a result of unequal Lorentz contraction of the positive 
and negative lines, a current-carrying wire that is electrically neutral in one 
inertial system will be charged in another. 
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Now, a line charge Atot sets up an electric field 


_ Atot 
2neEgs’ 


so there is an electrical force on q in S, to wit, 


= Ay qu 


F =qE= (12.86) 


TESNI wie 
But if there’s a force on q in S, there must be one in S; in fact, we can 
calculate it by using the transformation rules for forces. Since q is at rest in 
S, and F is perpendicular to u, the force in S is given by Eq. 12.68: 


resi- ron = (12.87) 


TEC? S 


The charge is attracted toward the wire by a force that is purely electrical in 
S (where the wire is charged, and q is at rest), but distinctly nonelectrical 
in S (where the wire is neutral). Taken together, then, electrostatics and 
relativity imply the existence of another force. This “other force” is, of 
course, magnetic. In fact, we can cast Eq. 12.87 into more familiar form by 
using c” = (€ouo)~! and expressing Av in terms of the current (Eq. 12.78): 


Lol 
F =-qu(—). 12.88 
qu(5—) (12.88) 
The term in parentheses is the magnetic field of a long straight wire, and the 
force is precisely what we would have obtained by using the Lorentz force 
law in system S. 


12.3.2 How the Fields Transform 


We have learned, in various special cases, that one observer’s electric 
field is another’s magnetic field. It would be nice to know the general 
transformation rules for electromagnetic fields: Given the fields in S, what 
are the fields in S? Your first guess might be that E is the spatial part of 
one 4-vector and B the spatial part of another. But your guess would be 
wrong — it’s more complicated than that. Let me begin by making explicit an 
assumption that has already been used implicitly in Section 12.3.1: charge 
is invariant. Like mass, but unlike energy, the charge of a particle is a fixed 
number, independent of how fast it happens to be moving. We shall assume 
also that the transformation rules are the same no matter how the fields 
were produced — electric fields associated with changing magnetic fields 
transform the same way as those set up by stationary charges. Were this 
not the case we'd have to abandon the field formulation altogether, for it is 
the essence of a field theory that the fields at a given point tell you all there 
is to know, electromagnetically, about that point; you do not have to append 
extra information regarding their source. 
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With this in mind, consider the simplest possible electric field: the 
uniform field in the region between the plates of a large parallel-plate 
capacitor (Fig. 12.35a). Say the capacitor is at rest in Sp and carries surface 
charges +09. Then 


Ep = Lh. (12.89) 


What if we examine this same capacitor from system S, moving to the right 
at speed vo (Fig. 12.35b)? In this system the plates are moving to the left, 
but the field still takes the form 


E=] 


= —; 12. 
af (12.90) 


the only difference is the value of the surface charge ø. [Wait a minute! Js 
that the only difference? The formula E = o/e9 for a parallel-plate capacitor 
came from Gauss’s law, and whereas Gauss’s law is perfectly valid for 
moving charges, this application also relies on symmetry. Are we sure that 
the field is still perpendicular to the plates? What if the field of a moving 
plane ¢ilts, say, along the direction of motion, as in Fig. 12.35c? Well, even 
if it did (it doesn’t), the field between the plates, being the superposition 
of the +o field and the —o field, would still run perpendicular to the plates 
(changing the sign of the charge reverses the direction of the field, and the 
vector sum kills off the parallel components). ] 

Now, the total charge on each plate is invariant, and the width (w) is 
unchanged, but the /ength (/) is Lorentz-contracted by a factor of 


1 


Yo =m 
fl - vie? 


(12.91) 
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so the charge per unit area is increased by a factor of yo: 
o = Y000. (12.92) 
Accordingly, 
E+ = yoEġ. (12.93) 
I have put in the superscript L to make it clear that this rule pertains to 
components of E that are perpendicular to the direction of motion of S. To 
get the rule for parallel components, consider the capacitor lined up with the 
yz plane (Fig. 12.36). This time it is the plate separation (d) that is Lorentz- 


contracted, whereas / and w (and hence also o) are the same in both frames. 
Since the field does not depend on d, it follows that 


E" = Eol. (12.94) 


Example 12.14. (Electric field of a point charge in uniform motion) A 
point charge q is at rest at the origin in system So. Question: What is 
the electric field of this same charge in system S, which moves to the 


right at speed vo relative to So? 


Solution. In So, the field is 


1 q. 
T ATED pe 
or 
1 qxo 
Exo = 4 24 42 4 52)3/2’ 
TLEQ (Xp + Vg + 29) 
Bys 1 4yo 
Ameo (x3 + yo + 22)3/2” 
1 Z 
Ex i 


7 Dada p2\3/2" 
ATE 9 (x5 + YH + zg) 
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=Y 


From the transformation rules (Eqs. 12.93 and 12.94), we have 


1 qXxo 
E; = Ey = ` 
: 0 Areo (x9 + ya + 2)? 
1 Voqvo 
E, = En = , 
HKO 4TEo (xp + yg + 2%)? 
1l 0920 
E; = VoL = y 


22232" 
Ameo (x5 +y + zg) 


These are still expressed in terms of the Sọ coordinates (xo, yo, Zo) 
of the field point (P); Pd prefer to write them in terms of the S 
coordinates of P. From the Lorentz transformations (or, actually, the 
inverse transformations), 


Xo = Yo(x + vot) = YoRx, 
yo = y = Ry, 
Z9 = Z = Az, 


where R is the vector from q to P (Fig. 12.37). Thus 


= 1 yoqR 
4m€o (y2 R? cos? O + R? sin? 6)3 


1 qd -= vole’) R 


= : 12.95 
4reo [1 — (vi/c?) sin? 9)3? R? 


This, then, is the field of a charge in uniform motion; we got the 
same result in Chapter 10 using the retarded potentials (Eq. 10.75). 
The present derivation is far more efficient, and sheds some light on 
the remarkable fact that the field points away from the instantaneous 
(as opposed to the retarded) position of the charge: E, gets a factor 
of yo from the Lorentz transformation of the coordinates; E, and E, 
pick up theirs from the transformation of the fie/d. It’s the balancing 
of these two yos that leaves E parallel to R. 


But Eqs. 12.93 and 12.94 are not the most general transformation laws, 
for we began with a system Sp in which the charges were at rest and where, 
consequently, there was no magnetic field. To derive the general rule, we 
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Yo 
vo y (v relative to S) 
> > 


HA 
=Y 


must start out in a system with both electric and magnetic fields. For this 
purpose S itself will serve nicely. In addition to the electric field 


E, = 7 (12.96) 


there is a magnetic field due to the surface currents (Fig. 12.35b): 

K+ = Fovox. (12.97) 
By the right-hand rule, this field points in the negative z direction; its 
magnitude is given by Ampére’s law (Ex. 5.8): 

Bz = -u00 vo. (12.98) 
In a third system, S, traveling to the right with speed v relative to S 
(Fig. 12.38), the fields would be 

E,=—, B= -u007, (12.99) 


where 7 is the velocity of S relative to So: 


V+Vo _ 1 


aa PS (12.100) 
4 V1 - vIe 


1 + vvo/c2’ 
and 
G=Y0o. (12.101) 
It remains only to express E and B (Eq. 12.99), in terms of E and B 
(Eqs. 12.96 and 12.98). In view of Eqs. 12.92 and 12.101, we have 


- y 5 y = 
E, = Cae B. = -(5-) oor. (12.102) 


With a little algebra, you can show that 


7 {=e 1+ Wo 2 = (1+) 


= = =y > (12.103) 
Yo V1l-wle V1- vlc? c 
where 
y= : (12.104) 
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N 


as always. Thus, writing E, in terms of the components of E and B in S, 


Ē,=y(1+ ae = y(E, - Tem) 
whereas 


Or, since uo€o = 1/2, 
(12.105) 


This tells us how £, and B, transform — to do E, and B,, we simply 
align the same capacitor parallel to the xy plane instead of the xz plane 
(Fig. 12.39). The fields in S are then 


Ez = aap B, = HooVvo. 

€0 ` 
(Use the right-hand rule to get the sign of B,.) The rest of the argument is 
identical — everywhere we had E, before, read E-, and everywhere we had 
B-, read —B,: 


E, = y (Ez F vBy), 
(12.106) 
2 y 

B, = y(By + GE). 


As for the x components, we have already seen (by orienting the capacitor 
parallel to the yz plane) that 
E, = E. (12.107) 


Since in this case there is no accompanying magnetic field, we cannot 
deduce the transformation rule for By. But another configuration will do 
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CUK = 


the job: imagine a long solenoid aligned parallel to the x-axis (Fig. 12.40) 
and at rest in S. The magnetic field within the coil is 


By = ponl, (12.108) 


where n is the number of turns per unit length, and J is the current. In 
system S, the length contracts, so n increases: 


n=yn. (12.109) 


On the other hand, time dilates: the S clock, which rides along with the 
solenoid, runs slow, so the current (charge per unit time) in Sis given by 


ers (12.110) 
Y 


The two factors of y exactly cancel, and we conclude that 


By = By. 


Like E, the component of B parallel to the motion is unchanged. 
Here, then, is the complete set of transformation rules: 


E, = E,, E, = yE, — vB), E, = y(E: T vB), 


_ 7 > E y (12.111) 
B, = By, By = y(B, + GE), B. = y(B: - 35). 
or, in the notation of Eq. 12.46, 
Ej = Ej, E, = y(E,+vxB,), 
(12.112) 


2 = v 
B, = By. By Pa, SE 


Two special cases warrant particular attention: 


(1) If B = 0 in S, then 


or, since V = v$, 


B=-40xB). (12.113) 
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(2) If E = 0 in S, then 
E = -yv(B-ĵ - B,ĉ) = -v(B.9 - By’), 


In other words, if either E or B is zero (at a particular point) in one system, 
then in any other system the fields (at that point) are very simply related by 
Eq. 12.113 or Eq. 12.114. 


or 


Example 12.15. (Magnetic field of a point charge in uniform motion) 
Find the magnetic field of a point charge q moving at constant 
velocity v. 


Solution. In the particle’s rest frame the magnetic field is zero (every- 
where), so in a system moving with velocity —v (in which the particle is 
moving at velocity +v)?” 


1 
z (vx E). 

We calculated the electric field in Ex. 12.14. The magnetic field, then, 
is 


B= 


_ Ho œA- v’/c”) sin 0 p 
4r [1 — (v2/c2) sin? 032 R>’ 


(12.115) 


where @ aims counterclockwise as you face the oncoming charge. 
Incidentally, in the nonrelativistic limit (v? « c°), Eq. 12.115 reduces 
to 


uo vxR 
B x — 
InI R” 


which is exactly what you would get by naive application of the Biot- 
Savart law to a point charge (Eq. 5.43). 


Problem 12.44. Why can't the electric field in Fig. 12.35b have a z component? After all, the 
magnetic field does. 


Problem 12.45. A point charge q is at rest at the origin in a uniform magnetic field Boy. You 
walk by in the —x direction, at speed v (so that from your perspective the charge moves at 
velocity vx). Why isn’t it deflected by the magnetic field? 


Problem 12.46. In system So, a static uniform line charge A coincides with the z-axis. 


(a) Write the electric field Ko in Cartesian coordinates, for the point (xo, yo, Zo). 
(b) Use Eq. 12.111 to find the electric field in S, which moves with speed v in the x direction 
with respect to Sọ. The field is still in terms of (xo, yo, Zo); express it instead in terms 


27 Here v is the particle’s velocity; in Eq. 12.113 it was the velocity of the reference frame. 
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of the coordinates (x, y, z) in S. Finally, write E in terms of the vector S from the 
present location of the wire and the angle 6 between S and x. Does the field point away 
from the instantaneous location of the wire, like the field of a uniformly moving point 
charge? 


Problem 12.47. 


(a) Charge q4 is at rest at the origin in system S; charge q g flies by at speed v ona trajectory 
parallel to the x-axis, but at y = d. What is the electromagnetic force on q g as it crosses 
the y-axis? 

(b) Now study the same problem from system S, which moves to the right with speed v. What 
is the force on gp when q4 passes the y-axis? [Do it two ways: (i) by using your answer 
to (a) and transforming the force; (ii) by computing the fields in S and using the Lorentz 
force law.] 


Problem 12.48. Two charges, +q, are on parallel trajectories a distance d apart, moving with 
equal speeds v in opposite directions. We're interested in the force on +q due to —q at the 
instant they cross (Fig. 12.41). Fill in the following table, doing all the consistency checks you 
can think of as you go along. 


System A | System B | System C 
(Fig. 12.41) | (+qatrest) | (—q at rest) 


Eat +q due to -q 
Bat +q due to -q 
F on +q due to —q 


Fig. 12.41 


Problem 12.49. 


(a) Show that (E - B) is relativistically invariant. 

(b) Show that (E? — c? B?) is relativistically invariant. 

(c) Suppose that in one inertial system B = 0 but E + O (at some point P). Is it possible to 
find another system in which the electric field is zero at P? 


566 


12 Electrodynamics and Relativity 


Problem 12.50. An electromagnetic plane wave of (angular) frequency œw is traveling in the 
x direction through the vacuum. It is polarized in the y direction, and the amplitude of the 
electric field is Eo. 


(a) Write down the electric and magnetic fields, E(x, y, z, £) and B(x, y, z, £). [Be sure 
to define any auxiliary quantities you introduce, in terms of w, Eo, and the constants of 
nature. ] 

(b) This same wave is observed from an inertial system S moving in the x direction with speed 
v relative to the original system S. Find the electric and magnetic fields in S, and express 
them in terms of the S coordinates: E(x, y, Z, A and B(x, y, Z, À . [Again, be sure to 
define any auxiliary quantities you introduce.] 

(0 Whatis the frequency & of the wave in S? Interpret this result. What is the wavelength A 
of the wave in S? From @ and A, determine the speed of the waves in S. Is it what you 
expected? 

(d) What is the ratio of the intensity in Sto the intensity in S? As a youth, Einstein wondered 
what an electromagnetic wave would look like if you could run along beside it at the speed 
of light. What can you tell him about the amplitude, frequency, and intensity of the wave, 
as v approaches c? 


Problem 12.51. Consider the following scenario: In the half-space x < O the electric and 
magnetic fields are zero; in the half-space x > O there is a uniform static magnetic field 


B= —Boz (x > 0), 


but still no electric field. A point charge q (mass 7) comes in from x = — oo, traveling at 
constant speed v along the x-axis. At time ¢ = O it reaches the origin. 


(a) Describe (and sketch) its subsequent motion. 

(b) Analyze this same problem from the perspective of an inertial reference frame S moving in 
the x direction at speed v. [Assume v is substantially less than c, so you can drop terms 
in y2/c2 (but not terms linear in v) — this just means that you can use nonrelativistic 
mechanics. ] 


12.3.3 The Field Tensor 


As Eq. 12.111 indicates, E and B certainly do not transform like the spatial 
parts of two 4-vectors — in fact, the components of E and B are stirred 
together when you go from one inertial system to another. What sort of 
an object is this, which has six components and transforms according to 
Eq. 12.111? Answer: It’s an antisymmetric, second-rank tensor. 
Remember that a 4-vector transforms by the rule 


at = Aba’ (12.116) 


(summation over v implied), where A is the Lorentz transformation matrix. 
If Sis moving in the x direction at speed v, A has the form 
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y -yf 0 0 
_|-vB y 0 0 

A=| 5 a a ole (12.117) 
0 0 01 


and A! is the entry in row u, column v. A (second-rank) tensor is an object 
with two indices, which transforms with two factors of A (one for each 
index): 


mY SACLAY, (12.118) 


A tensor (in 4 dimensions) has 4x4 = 16 components, which we can displa 
p play 

as a 4 x 4 array: 

%0 M 02403 

g0 gl p2 m3 

2 pl 2 Bp 

po pl pe B 


tev = 


However, the 16 elements need not all be different. For instance, a symmetric 
tensor has the property 


t!” = ¢’# (symmetric tensor). (12.119) 


In this case there are 10 distinct components; 6 of the 16 are repeats 

p p 
(1 = 710, 02 = 70, 3 BO 12 PI B= Pl, 3= 732). Similarly, an 
antisymmetric tensor obeys 


t!” = —{!" (antisymmetric tensor). (12.120) 


Such an object has just 6 distinct elements — of the original 16, 6 are repeats 
(the same ones as before, only this time with a minus sign) and 4 are zero 
(2, t!1, 2, and £°). Thus, the general antisymmetric tensor has the form 


0 0! 2 w 
-Æ 0 g2 m 


Let’s see how the transformation rule (Eq. 12.118) works, for the 6 
distinct components of an antisymmetric tensor. Starting with 7°!, we have 


W1 — A0 aldo 
t =N At”, 


but according to Eq. 12.117, AS = 0 unless A = 0 or 1, and A} = 0 unless 
o = 0 or 1. So there are four terms in the sum: 


P! = ADAGE? + ARATE! + ADAGE! + AT AGE. 
On the other hand, 7°° = ¢!! = 0, while £! = -7!°, so 


P! = (AA — AAG)” = (7 - (yp) = e. 
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Pll let you work out the others — the complete set of transformation rules is 
P=, yd- p), =y 4 l 


B Plyg 93, A2 ny2 902 (12.121) 
=; = y(t + Bt ), f= y(t pt ). 


These are precisely the rules we obtained on physical grounds for the 
electromagnetic fields (Eq. 12.111)—1in fact, we can construct the field tensor 
F!” by direct comparison:7® 

Ey Ey F3 = 


Pls Ps2 
C 


2 F? =B,, F'=B,„ FR=B, 
c gr Z3 ys xe 


Written as an array, 


0 Esle Eyle E-Ic 
-Ede 0 B- -B, 
-E\lc -B; 0 By 
-E.lc By -Bx 0 


Fe = (12.122) 


Thus relativity completes and perfects the job begun by Oersted, combining 
the electric and magnetic fields into a single entity, F"”. 

If you followed that argument with exquisite care, you may have noticed 
that there was a different way of embedding E and B in an antisymmetric 
tensor: instead of comparing the first line of Eq. 12.111 with the first line 
of Eq. 12.121, and the second with the second, we could relate the first line 
of Eq. 12.111 to the second line of Eq. 12.121, and vice versa. This leads to 
dual tensor, G}”: 


0 B B BŁ 
-B, 0 -Elce Ele 
-B, Eżlc 0 -E,Ic 
-B, -E,lce Elce 0 


(12.123) 


This G!” can be obtained directly from F!” by the substitution E/c > B, 
B — -E/c. Notice that this operation leaves Eq. 12.111 unchanged — that’s 
why both tensors generate the correct transformation rules for E and B. 


Problem 12.52. Work out the remaining five parts to Eq. 12.121. 


Problem 12.53. Provethat the symmetry (or antisymmetry) of a tensor is preserved by Lorentz 
transformation (that is, if £!” is symmetric, show that Z!” is also symmetric, and likewise for 
antisymmetric). 


Problem 12.54. Recall that a covariant 4-vector is obtained from a contravariant one by 
changing the sign of the zeroth component. The same goes for tensors: when you “lower an 


28 Some authors prefer the convention F°! = £,, F? =cB-., and so on, and some use the 
opposite signs. Accordingly, most of the equations from here on will look a little different, 
depending on the text. 
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index” to make it covariant, you change the sign if that index is zero. Compute the tensor 
invariants 


FH Fay, GE°Guy, and FP’ Guy, 
in terms of E and B. Compare Prob. 12.49. 


Problem 12.55. A straight wire along the z-axis carries a charge density A traveling in the 
+z direction at speed v. Construct the field tensor and the dual tensor at the point (x, 0, 0). 


12.3.4 Electrodynamics in Tensor Notation 


Now that we know how to represent the fields in relativistic notation, it is 
time to reformulate the laws of electrodynamics (Maxwell’s equations and 
the Lorentz force law) in that language. To begin with, we must determine 
how the sources of the fields, p and J, transform. Imagine a cloud of charge 
drifting by; we concentrate on an infinitesimal volume V, which contains 
charge Q moving at velocity u (Fig. 12.42). The charge density is 


Q 
Pp 

and the current density”? is 
J = pu. 


I would like to express these quantities in terms of the proper charge density 
po, the density in the rest system of the charge: 


w 
po = Vo’ 


where Vo is the rest volume of the cloud. Because one dimension (the one 
along the direction of motion) is Lorentz-contracted, 


V=V1-wWileVo, (12.124) 


29 I’m assuming all the charge in V is of one sign, and it all goes at the same speed. If not, 
you have to treat the constituents separately (see Chapter 5, footnote 8), but the result is 
the same. 
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and hence 
u 


1 
= pp- J= po~. 
eer V1 =le P V1 - w/c? 
Comparing this with Eqs. 12.40 and 12.42, we recognize here the compo- 


nents of proper velocity, multiplied by the invariant po. Evidently, charge 
density and current density go together to make a 4-vector: 


(12.125) 


Jt = pon", (12.126) 


whose components are 


Jt = (cp, Jx, Jy, Jz). (12.127) 


We’ll call it the current density 4-vector. 
The continuity equation (Eq. 5.29), 


dp 
V-J=-—, 
ôt 
expressing the local conservation of charge, takes on a nice compact form 
when written in terms of J”. For 


OJ, Jy BJ, Ñ$ ðJ 


V-J= = 5 
Ox E oy j Oz = Ox! 


while 
dp 10dJ°  as® 
ðt c ôt Ax 
Thus, bringing 0 p/ðt over to the left-hand side (in the continuity equation), 
we have 


(12.128) 


ost 


—— = 12.129 
Axe TO ( ) 


with summation over u implied. Incidentally, 0/"/dx" is the four- 
dimensional divergence of J", so the continuity equation states that the 
current density 4-vector is divergenceless. 

As for Maxwell’s equations, they can be written 


sgt “20, (12.130) 


with summation over v implied. Each of these stands for four equations — 
one for every value of u. If u = 0, the first equation reads 


OF” OF” OF! OF” OF 
= + + + 
Ox” 0x9 Ox! Ox? 0x3 
1,0E, OE, dE. 1 
= - = pe -E 
n a a ) 
= oJ’ = uocp, 
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or 


1 
V-E=—p. 
ar 


This, of course, is Gauss’s law. If u = 1, we have 


OF OF! OF}! OF oF! 


Ox” Ax : Ox! i Ox? : 0x? 
10E, ôB. ðB, , 10E 
“2 ot” ay & =(-a5y YB) 
= tod’ = tod z. 


Combining this with the corresponding results for u = 2 and u = 3 gives 
OE 
VxB= Lod T HoEo g? 


which is Ampère’s law with Maxwell’s correction. 
Meanwhile, the second equation in 12.130, with u = 0, becomes 


ac” aG aG AG aG® 


Ox” Axo + Ox! j Ox? d ðx? 
: B, > 

2B 8 » , OB: y.B0 
Ox oy Oz 


(the third of Maxwell’s equations), whereas u = 1 yields 


ðG” ðG! ðG!! ðG! ðG! 


Ox” Ax y Ox! ý Ox? x? 
1B, 13E. 10Ey _ 1 (2B 


c Ot coy caz 


So, combining this with the corresponding results for u = 2 and u = 3, 


OB 
VxE= TFP 
which is Faraday’s law. In relativistic notation, then, Maxwell’s four rather 
cumbersome equations reduce to two delightfully simple ones. 
In terms of F!” and the proper velocity n“, the Minkowski force on a 
charge q is given by 


K! = qq} F". (12.131) 


For if u = 1, we get 
K! = qF” z qnl F! P nF” a: nF? F PF’) 


= | an N E ee), 
VI =le ` © Vi-wie VI wie” 
q 
= ———[|E+ (ux B)];, 
V1 — w/c? 
with a similar formula for u = 2 and u = 3. Thus, 
K = —“_|E+ (ux B)], (12.132) 


V1 — w/c? 
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and therefore, referring back to Eq. 12.70, 
F = q|E + (u x B)], 


which is the Lorentz force law. Equation 12.131, then, represents the 
Lorentz force law in relativistic notation. PII leave for you the interpretation 
of the zeroth component (see Prob. 12.58). 


Problem 12.56. Obtain the continuity equation (Eq. 12.129) directly from Maxwell's equations 
(Eq. 12.130). 


Problem 12.57. Show that the second equation in 12.130 can be expressed in terms of the field 
tensor F}” as follows: 
OF w i OF, R OF iu z 
Ox' Ox" Ox” 


0. (12.133) 


Problem 12.58. Work out, and interpret physically, the u = O component of the electromag- 
netic force law, Eq. 12.131. 


Problem 12.59. Maxwell's equations are invariant under Lorentz transformations. That is, if 
they holdin S, 


1 ôB OE 
V-E=—p, V-B=0, VxE=-—, VxB= uJ = 
Be - At x HOYT HOS0 


then they automatically hold as well in S: 


Sane ol ne “e OB ee = dE 

V-E=—p, V-B=0, VxE=-—, VxB= uoJ+ =. 
ane at De 

Prove it, using the transformation rules 12.111 and the fact that the current 


J! = (cp, Jx, Jy, Jz) is a 4-vector.® [Hint: Note that 


Odie (ox 0 oy ôðzô | (a5) 
Of aa wax wy dfdz u ‘dx?’ 


and similarly for the other derivatives.] 


12.3.5 Relativistic Potentials 


From Chapter 10, we know that the electric and magnetic fields can be 
expressed in terms of a scalar potential V and a vector potential A: 


E=-vy- B=VxA. (12.134) 


30 Of course, you can do it much more efficiently using tensor notation — in fact, it is trivial 
given Eq. 12.130, since both sides transform the same way; part of the beauty of this 
language is that it makes consistency with relativity manifest visually. But the point of this 
problem is to see how it works out in the more familiar three-dimensional notation. 


573 


12.3 Relativistic Electrodynamics 


As you might guess, V and A together constitute a 4-vector: 


At = (Ve, Ax, Ay, Az). (12.135) 


In terms of this 4-vector potential, the field tensor can be written 


pw _ 94" dt 
OX, OX, 


(12.136) 


(Observe that the differentiation is with respect to the covariant vectors x, 
and x,; remember, that changes the sign of the zeroth component: xo = —x°. 
See Prob. 12.60.) 
To check that Eq. 12.136 is equivalent to Eq. 12.134, let’s evaluate a few 
terms explicitly. For u = 0, v = 1, 
pu 04) 8A OA, 18V 
Oxo ôx O(ct) cdx 


1/0A E, 
= oer +VV) = T 


That (and its companions with v = 2 and v = 3) is the first equation in 
Eq. 12.134. For u = 1, v = 2, we get 


x 


2 1 ðA, . 
s o Se air oie ks 


Fl 
Ox; Ox Ox oy 


which (together with the corresponding results for F? and F°!) is the 
second equation in Eq. 12.134. 

The potential formulation automatically takes care of the homogeneous 
Maxwell equation (0G""/0x” =0). As for the inhomogeneous equation 
(OF""/0x" = uoJ"), that becomes 


oe. 2 (244) 


TA = poj". (12.137) 


This is an intractable equation as it stands. However, you will recall that the 
potentials are not uniquely determined by the fields — in fact, it’s clear from 
Eq. 12.136 that you could add to A” the gradient of any scalar function A: 


AH +» Al = Abt z (12.138) 


without changing F””. This is precisely the gauge invariance we noted in 
Chapter 10; we can exploit it to simplify Eq. 12.137. In particular, the 
Lorenz gauge condition (Eq. 10.12) 


10V 
Ve ae 


becomes, in relativistic notation, 


AH 
a = (12.139) 


574 


12 Electrodynamics and Relativity 


In the Lorenz gauge, therefore, Eq. 12.137 reduces to the inhomogeneous 
wave equation (10.16): 


OAH = —poJ, (12.140) 


where o° is the d’Alembertian (10.15): 


ð ð 1 8? 

2 2 

= =V-=—. 12.141 
= Ox, Ox” e Or ( ) 
Equation 12.140 combines our previous results into a single 4-vector equa- 
tion — it represents the most elegant formulation of Maxwell’s equations.*! 


Problem 12.60. You may have noticed that the four-dimensional gradient operator 
ð/ðx" functions like a covariant 4-vector — in fact, it is often written ô, for short. For 
instance, the continuity equation, ô, J“ =0, has the form of an invariant product of two 
vectors. The corresponding contravariant gradient would be 0“ = 0/0X,,. Prove that ô” h 
is a (contravariant) 4-vector, if œ is a scalar function, by working out its transformation law, 
using the chain rule. 


Problem 12.61. Show that the potential representation (Eq. 12.136) automatically satisfies 
OG"Y/Ax” = 0. [Suggestion: Use Prob. 12.57.] 


Problem 12.62. Show that the Liénard—Wiechert potentials (Eqs. 10.46 and 10.47) can be 
expressed in relativistic notation as 
u 
E a a one Ee 
4reoe (n’2y)’ 


where 24 = x! — w! (t,). 


More Problems on Chapter 12 
EE) 


Problem 12.63. Inertial system S moves at constant velocity v = Bc(cos o È + sin ¢ ĵ) 
with respect to S. Their axes are parallel to one another, and their origins coincide at ¢ = ¢ = 0, 
as usual. Find the Lorentz transformation matrix A (Eq. 12.25). 


y YB cos Se ag) 
-yBcos@ (ycos*p+sin?) (y—-1)sin@cosd 
-yBsingd (y-l)sin@cosh (ysin? o +cos* gp) 

0 0 0 


Answer: 


— 2a 2 & 


3! Incidentally, the Coulomb gauge is bad, from the point of view of relativity, because its 
defining condition, V - A = 0, is destroyed by Lorentz transformation. To restore this 
condition, it is necessary to perform an appropriate gauge transformation every time you 
go to a new inertial system, in addition to the Lorentz transformation itself. In this sense, 
AF is not a true 4-vector, in the Coulomb gauge. 
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Problem 12.64. Calculate the threshold (minimum) momentum the pion must have in order 
for the process 7: + p > K + to occur. The proton p is initially at rest. Use mnc? = 150, 
mge = 500,mpc? = 900, myc? = 1200 (allin MeV). [Hint: To formulate the threshold 
condition, examine the collision in the center-of-momentum frame (Prob. 12.31).] [Answer: 
1133 MeV/c.] 


Problem 12.65. A particle of mass m collides elastically with an identical particle at rest. 
Classically, the outgoing trajectories always make an angle of 90°. Calculate this angle rela- 
tivistically, in terms of œ, the scattering angle, and v, the speed, in the center-of-momentum 
frame. [Answer: tan~!(2c?/v?y sin p). 


Problem 12.66. Find x as a function of ¢ for motion starting from rest at the origin under 
the influence of a constant Minkowski force in the x direction. Leave your answer in implicit 
form (z as a function of x). [Answer: 2Kt/mc = zV1 + 22 + ln(z + V1 + 22), where 
z = V2Kx/mc?.] 


Problem 12.67. An electric dipole consists of two point charges (+q), each of mass 7n, fixed 
to the ends of a (massless) rod of length d. (Do not assume d is small.) 


(a) Find the net self-force on the dipole when it undergoes hyperbolic motion (Eq. 12.62) along 
a line perpendicular to its axis. [Hint: Start by appropriately modifying 11.90.] 

(b) Notice that this self-force is constant (t drops out), and points in the direction of motion 
— just right to produce hyperbolic motion. Thus it is possible for the dipole to undergo 
self-sustaining accelerated motion with no external force at all!’ (Where do you suppose 
the energy comes from?) Determine the self-sustaining force, F, in terms of m, q, and d. 


[Answer: (2mc*/d) Al (uoq2/8md)23 — 11] 


Problem 12.68. An ideal magnetic dipole m is located at the origin of an inertial system S 

that moves with speed v in the x direction with respect to inertial system S. In S the vector 

potential is 

- omx f 

A = — 
4n r2 


(Eq. 5.85), and the scalar potential V is zero. 


(a) Find the scalar potential V in S. 


R-(vxm) =) 

4neo ER? (1-2/2) sin? 0)? 

(b) In the nonrelativistic limit, show that the scalar potential in S is that of an ideal electric 
dipole of magnitude 


[Answer: 


vxm 


= 2 


2 
located at O. 


32 E H. J. Cornish, Am. J. Phys. 54, 166 (1986). 
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> 
' 


Fig. 12.43 


Problem 12.69. A stationary magnetic dipole, m = mm2, is situated above an infinite uniform 
surface current, K = Kx (Fig. 12.43). 


(a) Find the torque on the dipole, using Eq. 6.1. 

(b) Suppose that the surface current consists of a uniform surface charge o, moving at velocity 
v = vx, so that K = ov, and the magnetic dipole consists of a uniform line charge A, 
circulating at speed v (same v) around a square loop of side Z, as shown, so that m = 
AvP. Examine the same configuration from the point of view of system S, moving in the 
x direction at speed v. In S, the surface charge is at rest, so it generates no magnetic field. 
Show that in this frame the current loop carries an electric dipole moment, and calculate 
the resulting torque, using Eq. 4.4. 


Problem 12.70. Ina certain inertial frame S, the electric field E and the magnetic field B 
are neither parallel nor perpendicular, at a particular space-time point. Show that in a different 
inertial system S, moving relative to S with velocity v given by 
v _ ExB 
1+v/2 B2 + EN? 


the fields E and B are parallel at that point. Is there a frame in which the two are perpendicular? 


Problem 12.71. “Derive” the Lorentz force law, as follows: Let charge q be at rest in S, so 
F = gE, and let S move with velocity v = vx with respect to S. Use the transformation 
rules (Eqs. 12.68 and 12.111) to rewrite F in terms of F, and E in terms of E and B. From these, 
deduce the formula for F in terms of E and B. 


Problem 12.72. A charge q is released from rest at the origin, in the presence of a uniform 
electric field E = oz and a uniform magnetic field B = Box. Determine the trajectory of the 
particle by transforming to a system in which E = 0, finding the path in that system and then 
transforming back to the original system. Assume Ey < c Bo. Compare your result with Ex. 5.2. 


Problem 12.73. 


(a) Construct a tensor D!” (analogous to F}”) out of D and H. Use it to express Maxwell's 
equations inside matter in terms of the free current density Jr .[Answer: D’! = cD, 
D? = H, etc; 0D!” /8 x” = Zi 

(b) Construct the dual tensor Æ¥” (analogous to GŁ”). [Answer: H! = Hy, H? = 
—cD,, etc] 
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(c) Minkowski proposed the relativistic constitutive relations for linear media: 
1 
Dn, = eF ny, and Ht’ ny = = G0 ny, 
u 


where e is the proper” permittivity, 11 is the proper permeability, and 1” is the 4-velocity 
of the material. Show that Minkowski’s formulas reproduce Eqs. 4.32 and 6.31, when the 
material is at rest. 

(d) Work out the formulas relating D and H to E and B for a medium moving with (ordinary) 
velocity u. 


Problem 12.74. Use the Larmor formula (Eq. 11.70) and special relativity to derive the Liénard 
formula (Eq. 11.73). 


Problem 12.75. The natural relativistic generalization of the Abraham—Lorentz formula 
(Eq. 11.80) would seem to be 

(i Log? dat 

rad Gnc dt” 
This is certainly a 4-vector, and it reduces to the Abraham—Lorentz formula in the nonrelativistic 
limity < c. 


(a) Show, nevertheless, that this is not a possible Minkowski force. [Hint: See Prob. 12.39d.] 

(b) Find a correction term that, when added to the right-hand side, removes the objection you 
raised in (a), without affecting the 4-vector character of the formula or its nonrelativistic 
limit. 


Problem 12.76. Generalize the laws of relativistic electrodynamics (Eqs. 12.130 and 12.131) to 
include magnetic charge. [Refer to Section 7.3.4.] 


33 As always, “proper” means “in the rest frame of the material.” 
34 For interesting commentary on the relativistic radiation reaction, see F. Rohrlich, Am. J 
Phys. 65, 1051 (1997). 


Vector Calculus in Curvilinear Coordinates 


A.1 Introduction 
QO űr 


In this appendix I sketch proofs of the three fundamental theorems of 
vector calculus. My aim is to convey the essence of the argument, not to 
track down every epsilon and delta. A much more elegant, modern, and 
unified — but necessarily also much longer — treatment will be found in M. 
Spivak’s book, Calculus on Manifolds, Benjamin, New York (1965). 

For the sake of generality, I shall use arbitrary (orthogonal) curvilinear 
coordinates (u,v,w), developing formulas for the gradient, divergence, 
curl, and Laplacian in any such system. You can then specialize them to 
Cartesian, spherical, or cylindrical coordinates, or any other system you 
might wish to use. If the generality bothers you on a first reading, and you’d 
rather stick to Cartesian coordinates, just read (x, y, z) wherever you see 
(u, v, w), and make the associated simplifications as you go along. 


A.2 Notation 
QO Err Ee 


We identify a point in space by its three coordinates, u, v, and w (in the 
Cartesian system, (x,y,z); in the spherical system, (r,0,@); and in the 
cylindrical system, (s, , z)). I shall assume the system is orthogonal, in the 
sense that the three unit vectors, û, ¥, and W, pointing in the direction of 
the increase of the corresponding coordinates, are mutually perpendicular. 
Note that the unit vectors are functions of position, since their directions 
(except in the Cartesian case) vary from point to point. Any vector can 
be expressed in terms of û, ¥, and w — in particular, the infinitesimal 
displacement vector from (u, v, w) to (u+du, v+dv, w+ dw) can be written as 


dl =f dui+gdvv+hdww, (A.1) 


where f, g, and A are functions of position characteristic of the particular 
coordinate system (in Cartesian coordinates f = g = h = 1; in spherical 
coordinates f = 1, g =r, h = r sin 0; and in cylindrical coordinates f = h = 
1, g = s). As you'll soon see, these three functions tell you everything you 
need to know about a coordinate system. 
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A.3 Gradient 


If you move from point (u, v, w) to point (u + du, v + dv, w + dw), a scalar 
function ¢(u, v, w) changes by an amount 


dt = — du + — dv + dw; (A.2) 


this is a standard theorem on partial differentiation.! We can write it as a 
dot product, 


dt = Vt - d = (Vt) f du + (Vtg dv + (Vth dw, (A.3) 
provided we define 
(Viu = fou’ (Vt), = zy (Vi = how 


The gradient of ż, then, is 


—û + —-—V¥+ —-—wW. (A.4) 


If you now pick the appropriate expressions for f, g, and h from Table A.1, 
you can easily generate the formulas for V¢ in Cartesian, spherical, and 
cylindrical coordinates, as they appear inside the front cover of the book. 

From Eq. A.3 it follows that the total change in t, as you go from point 
a to point b (Fig. A.1), is 


b b 
t(b) — t(a) = { dt = i (V2) - dl, (A.5) 
a a 
Table A.1. 
System u v w I g h 
Cartesian x y Z 1 1 1 
Spherical r (2) 0) 1 r rsin 0 
Cylindrical Ss O) z 1 s 1 
b 
dl 
a 


l M. Boas, Mathematical Methods in the Physical Sciences, 2nd ed., John Wiley, New York 
(1983), Chap. 4, Sect. 3. 
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which is the fundamental theorem for gradients (not much to prove, really, 
in this case). Notice that the integral is independent of the path taken from 
atob. 


A.4 Divergence 
OIS l 


Suppose that we have a vector function, 
A(u, v, w) = A„û + A,Ŷ + AyW, 


and we wish to evaluate the integral $ A-da over the surface of the infinites- 
imal volume generated by starting at the point (u, v, w) and increasing each 
of the coordinates in succession by an infinitesimal amount (Fig. A.2). 
Because the coordinates are orthogonal, this is (at least, in the infinitesimal 
limit) a rectangular solid, whose sides have lengths dl, = f du, dl, = g dv, 
and dl, = h dw, and whose volume is therefore 


dt = dl, dl, dl, = (fgh) du dv dw. (A.6) 


(The sides are not just du, dv, dw — after all, v might be an angle, in which 
case dv doesn’t even have the dimensions of length. The correct expressions 
follow from Eq. A.1.) 


(u + du, v + dy, w) 


(u + du, v, w) 


(u + du, v + dv, w + dw) 
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For the front surface, 
da = —(gh) dvdwu, 
so that 
A- da = —(ghA,,) dv dw. 


The back surface is identical (except for the sign), only this time the quantity 
ghA,, is to be evaluated at (u+ du), instead of u. Since for any (differentiable) 
function F(u), 


F(u + du) — F(u) = T u 
(in the limit), the front and back together amount to a contribution 
ð 1 ð 
lz (ghA,)] du dv dw = TR Ju (ghA,,) dT. 


By the same token, the right- and left-hand sides yield 
1 ð 
feh dv 
and the top and bottom give 
1 
feh dw 


(fhA,) dt, 


EE TE dt. 
All told, then, 
ð 
$ A -da= alé (ghA,) + 2 ma» + 5, Aal dt. (A.7) 


The coefficient of dt serves to define the divergence of A in curvilinear 
coordinates: 


= ô 
V-A= (ghAy) + 2 hav) + > (feAw)], (A.8) 


a lE 


and Eq. A.7 becomes 
pa -da=(V-A)dt. (A.9) 


Using Table A.1, you can now derive the formulas for the divergence in 
Cartesian, spherical, and cylindrical coordinates, which appear in the front 
cover of the book. 

As it stands, Eq. A.9 does not prove the divergence theorem, for it per- 
tains only to infinitesimal volumes, and rather special infinitesimal volumes 
at that. Of course, a finite volume can be broken up into infinitesimal pieces, 
and Eq. A.9 can be applied to each one. The trouble is, when you then 
add up all the bits, the left-hand side is not just an integral over the outer 
surface, but over all those tiny internal surfaces as well. Luckily, however, 
these contributions cancel in pairs, for each internal surface occurs as the 
boundary of two adjacent infinitesimal volumes, and since da always points 
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outward, A - da has the opposite sign for the two members of each pair 
(Fig. A.3). Only those surfaces that bound a single chunk — which is to say, 
only those at the outer boundary — survive when everything is added up. 
For finite regions, then, 


fa-da= [(v-Adr, (A.10) 


and you need integrate only over the external surface.” This establishes the 
divergence theorem. 


A.5 Curl 


To obtain the curl in curvilinear coordinates, we calculate the line integral, 


paca 


around the infinitesimal loop generated by starting at (u,v,w) and suc- 
cessively increasing u and v by infinitesimal amounts, holding w constant 
(Fig. A.4). The surface is a rectangle (at least, in the infinitesimal limit), of 
length dl, = f du, width dl, = g dv, and area 


da = (fg) dudvw. (A.11) 


Assuming the coordinate system is right handed, w points out of the page in 

Fig. A.4. Having chosen this as the positive direction for da, we are obliged 

by the right-hand rule to run the line integral counterclockwise, as shown. 
Along the bottom segment, 


d= f dua, 


2 What about regions that cannot be fit perfectly by rectangular solids no matter how tiny they 
are — such as planes cut at an angle to the coordinate lines? It’s not hard to dispose of this 
case; try thinking it out for yourself, or look at H. M. Schey’s Div, Grad, Curl and All That, 
W. W. Norton, New York (1973), starting with Prob. II-15. 
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(u, v + dv, we ae + du, v + dv, w) 


(u + du, v, w) 


=> 


SO 


A - d| = (fA,) du. 


Along the top leg, the sign is reversed, and f4, is evaluated at (v+ dy) rather 
than v. Taken together, these two edges give 


ð 
EFAs + SA] du = -7 Fu) | du dv. 
Similarly, the right- and left-hand sides yield 
ð 

[3 64] udv, 
so the total is 

ð ð 

pa d= [aut = 5 A| du dv 


- le (g4,) - ofA) |i da. (A.12) 


The coefficient of da on the right serves to define the w component of the 
curl. Constructing the u and v components in the same way, we have 


vxas ols (hAy) - led] g alos -(fAy) - ĉina] 
1 


ra a... y ae Au Ñ > 
ve Fa 2y JE 
(A.13) 
and Eq. A.11 generalizes to 
f A-dl= (Vx A) da. (A.14) 


Using Table A.1, you can now derive the formulas for the curl in Cartesian, 
spherical, and cylindrical coordinates. 

Equation A.14 does not by itself prove Stokes’ theorem, however, 
because at this point it pertains only to very special infinitesimal surfaces. 
Again, we can chop any finite surface into infinitesimal pieces and apply 
Eq. A.14 to each one (Fig. A.5). When we add them up, though, we obtain 
(on the left) not only a line integral around the outer boundary, but a lot of 
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RIRTC 
fai 
Paga 


tiny line integrals around the internal loops as well. Fortunately, as before, 
the internal contributions cancel in pairs, because every internal line is the 
edge of two adjacent loops running in opposite directions. Consequently, 
Eq. A.14 can be extended to finite surfaces, 


aaz |x: aa, (A.15) 


and the line integral is to be taken over the external boundary only.* This 
establishes Stokes’ theorem. 


A.6 Laplacian 
——EEeEEeEEeEEE———————————— ey 


Since the Laplacian of a scalar is by definition the divergence of the 
gradient, we can read off from Eqs. A.4 and A.8 the general formula 


_ 1d ;ghdt, 0d ;fhdt, ð (fe dt 
vr EER ER SEM) a0 


Once again, you are invited to use Table A.1 to derive the Laplacian in 
Cartesian, spherical, and cylindrical coordinates, and thus to confirm the 
formulas inside the front cover. 


3 What about surfaces that cannot be fit perfectly by tiny rectangles, no matter how small they 
are (such as triangles) or surfaces that do not correspond to holding one coordinate fixed? If 
such cases trouble you, and you cannot resolve them for yourself, look at Schey, footnote 2, 
Prob. III-2. 


The Helmholtz Theorem 


Suppose we are told that the divergence of a vector function F(r) is a 
specified scalar function D(r): 


V-F=D, (B.1) 
and the curl of F(r) is a specified vector function C(r): 

VxF=C. (B.2) 
For consistency, C must be divergenceless, 

V-C=0, (B.3) 


because the divergence of a curl is always zero. Question: Can we, on the 
basis of this information, determine the function F? If D(r) and C(r) go 
to zero sufficiently rapidly at infinity, the answer is yes, as I will show by 
explicit construction. 

I claim that 


F=-VU+VxwW, (B.4) 
where 
_ 1 Dr’), 
U(r) = Tr z dt (B.5) 
and 
_ 1 Ce) |, 
Wir) = =f > dv’; (B.6) 


the integrals are over all of space, and, as always, 2 = |r — r'|. For if F is 
given by Eq. B.4, then its divergence (using Eq. 1.102) is 


v-F=-vu=-7 | DG) ar = f Dea- rar = pw. 


(Remember that the divergence of a curl is zero, so the W term drops out, 
and note that the differentiation is with respect to r, which is contained in 7.) 
So the divergence is right; how about the curl? 


VxF=Vx(VxW) =-V’W+WV-W). (B.7) 


(Since the curl of a gradient is zero, the U term drops out.) Now 


w= -i | G) = f coe- rar = Ce, 
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which is perfect — I'll be done if I can just persuade you that the second 
term on the right-hand side of Eq. B.7 vanishes. Using integration by parts 
(Eq. 1.59), and noting that derivatives of 2 with respect to primed coor- 
dinates differ by a sign from those with respect to unprimed coordinates, 
we have 


anv-w= | C-¥(>)ar =- f cv’ ba T’ 
-|; SY", ina C- da. (B.8) 


But the divergence of C is zero, by assumption (Eq. B.3), and the surface 
integral (way out at infinity) will vanish, as long as C goes to zero sufficiently 
rapidly. 

Of course, that proof tacitly assumes that the integrals in Eqs. B.5 and 
B.6 converge — otherwise U and W don’t exist at all. At the large r’ limit, 
where 2 ~ r’, the integrals have the form 


f Wag- f r' X (r) dr. (B.9) 


(Here X stands for D or C, as the case may be.) Obviously, X(r’) must go 
to zero at large r’ — but that’s not enough: if X ~ 1/r', the integrand is 
constant, so the integral blows up, and even if X ~ 1/r”, the integral is 
a logarithm, which is still no good at r’ — œ. Evidently, the divergence 
and curl of F must go to zero more rapidly than 11r? for the proof to hold. 
(Incidentally, this is more than enough to ensure that the surface integral in 
Eq. B.8 vanishes.) 

Now, assuming these conditions on D(r) and C(r) are met, is the solution 
in Eq. B.4 unique? The answer is clearly no, for we can add to F any vector 
function whose divergence and curl both vanish, and the result still has 
divergence D and curl C. However, it so happens that there is no function 
that has zero divergence and zero curl everywhere and goes to zero at infinity 
(see Section 3.1.5). So if we include a requirement that F(r) goes to zero as 
r > œ, then solution B.4 is unique.! 

Now that all the cards are on the table, I can state the Helmholtz theorem 
more rigorously: 


If the divergence D(r) and the curl C(r) of a vector function F(r) 
are specified, and if they both go to zero faster than 1/r? as r > o0, 
and if F(r) goes to zero as r — œ, then F is given uniquely by 
Eq. B.4. 


1 Typically, we do expect the electric and magnetic fields to go to zero at large distances from 
the charges and currents that produce them, so this is not an unreasonable stipulation. 
Occasionally, one encounters artificial problems in which the charge or current distribution 
itself extends to infinity — infinite wires, for instance, or infinite planes. In such cases, other 
means must be found to establish the existence and uniqueness of solutions to Maxwell’s 
equations. 
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The Helmholtz theorem has an interesting corollary: 


Any (differentiable) vector function F(r) that goes to zero faster than 
l/r as r > œ can be expressed as the gradient of a scalar plus the curl 


of a vector:” 
F(r) -v(z f TEOD ax’) 


4n a 
1 V’x Fa’) 
ga | ee '). B.10 
S (z f 2 i ( ) 
For example, in electrostatics, V - E = p/€o and V x E = 0, so 
1 pæ) AL 
E(r) = -v(a — dt =-VvV (B.11) 


(where V is the scalar potential), while in magnetostatics V - B = 0 and 
V x B = uoJ, so 


Bo =vx(i f Œa) =vxa (B.12) 


(where A is the vector potential). 


? Asa matter of fact, any differentiable vector function whatever (regardless of its behavior at 
infinity) can be written as a gradient plus a curl, but this more general result does not follow 
directly from the Helmholtz theorem, nor does Eq. B.10 supply the explicit construction, 
since the integrals, in general, diverge. 

One way to prove it relies on the existence theorem for solutions to Poisson’s equation. 
Given F, let f = V - F; then there exists a function A whose Laplacian is f: 


Whef. 
It follows that 
V-(F-Vh) =0. 
But any divergenceless vector can be written as the curl of a vector (call it H): 


F-VWh=VxH, or F=VA+VXH. 


In our units (the Systéme International), Coulomb’s law reads 


1 142 A 
= —A I). C.1 
4men 2 SY) el) 


Mechanical quantities are measured in meters, kilograms, seconds, and 
charge is in coulombs (Table C.1). In the Gaussian system, the constant in 
front is, in effect, absorbed into the unit of charge, so that 


F= ie (Gaussian). (C.2) 
Mechanical quantities are measured in centimeters, grams, seconds, and 
charge is in electrostatic units (or esu). For what it’s worth, an esu is a 
(dyne)!/?-centimeter. Converting electrostatic equations from SI to Gaus- 
sian units is not difficult: just set 
€ > —. 
0? On 
For example, the energy stored in an electric field (Eq. 2.45), 
€0 2 
U= z Edt (SD, 


becomes 


U= fea (Gaussian). 
8m 


(Formulas pertaining to fields inside dielectrics are not so easy to translate, 
because of differing definitions of displacement, susceptibility, and so on; 
see Table C.2.) 

The Biot-Savart law, which for us reads 


spf A 
B=7/ f a e (C.3) 
becomes, in the Gaussian system, 
I à ; 
B=- = (Gaussian), (C.4) 
c WV 


where c is the speed of light, and current is measured in esu/s. The Gaussian 
unit of magnetic field (the gauss) is the one quantity from this system in 
everyday use: people speak of volts, amperes, henries, and so on (all SI 
units), but for some reason they tend to measure magnetic fields in gauss 
(the Gaussian unit); the correct SI unit is the tesla (10* gauss). 
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Table C.1. Conversion factors. [Note: Except in exponents, every “3” is short for 


& = 2,99792458 (the numerical value of the speed of light), “9” means a?, and 12” is 4a. ] 


Quantity SI Factor Gaussian 

Length meter (m) 10? centimeter 

Mass kilogram (kg) 10° gram 

Time second (s) 1 second 

Force newton (N) 10° dyne 

Energy joule (J) 107 erg 

Power watt (W) 107 erg/second 

Charge coulomb (C) 3 x 10° esu (statcoulomb) 
Current ampere (A) 3x 10? esu/second (statampere) 
Electric field volt/meter (1/3) x 10-4 statvolt/centimeter 
Potential volt (V) 1/300 statvolt 

Displacement coulomb/meter? 127 x 10° statcoulomb/centimeter” 
Resistance ohm (Q) (1/9) x 107"! second/centimeter 
Capacitance farad (F) 9 x 10!! centimeter 

Magnetic field tesla (T) 104 gauss 

Magnetic flux weber (Wb) 108 maxwell 

H ampere/meter 4r x 10° oersted 

Inductance henry (H) (1/9) x 107!!  second?/centimeter 


One major virtue of the Gaussian system is that electric and magnetic 
fields have the same dimensions (in principle, one could measure the electric 
fields in gauss too, though no one uses the term in this context). Thus the 
Lorentz force law, which we have written 


F=q(E+vxB) (SD, (C.5) 
(indicating that E/B has the dimensions of velocity), takes the form 
v . 
F= q(E + z x B) (Gaussian). (C.6) 
In effect, the magnetic field is “scaled up” by a factor of c. This reveals more 


starkly the parallel structure of electricity and magnetism. For instance, the 
total energy stored in electromagnetic fields is 


U= = I (E? + B’)dt (Gaussian), (C.7) 


eliminating the €o and uo that spoil the symmetry in the SI formula, 


1 1 
U=5 f (eoE? + ne) dt (SI). (C.8) 
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Table C.2. Fundamental equations in SI and Gaussian units. 


SI Gaussian 
Maxwell’s equations 
In general: V-E= ~p V-E=47p 
0 
V x E = —ôB/ðt VxE= ~~ aBi01 
V-B=0 V-B=0 
4 1 
Vx B= uoJ + ueðE/ðt VxB= Zy + Z0E/ðt 
In matter: vV-D=p; V-D=4rp; 
V x E = —ôB/ôðt VxE= -! splat 
c 
V-B=0 V-B=0 
V xH = J; + ôD/ôðt vxH= 42y + lapiar 
c c 
D and H 


Definitions: [ 


Linear media: [ 


Lorentz force law 


Energy and power 


Energy: 
Poynting vector: 


Larmor formula: 


D= €E +P 
1 
H = —B-M 
Ho 
P=e9x-E, D=ceE 


M = XnH, H= z8 


F=q(Œ+vx B) 


U= 5 f (cot + 5) dt 


2 Lo 
1 

S = —(ExB) 
Lo 

pe 2¢0 
4m€93 3 


D=E+47P 
H=B-47M 


P=y7E, D=eH 
M = XnH, H= 7B 


F=q4(E+* xB) 


U= a [eB de 
87 


S= <(ExB) 
4n 
9) 2q? 
poeu 
Bie 


Table C.2 lists some of the basic formulas of electrodynamics in both 
systems. For equations not found here, and for Heaviside—Lorentz units, 
I refer you to the appendix of J. D. Jackson, Classical Electrodynamics, 
3rd ed., John Wiley, New York (1999), where a more complete listing is 


to be found.! 


l For an interesting “primer” on electrical SI units, see N. M. Zimmerman, Am. J. Phys. 66, 
324 (1998); the history is discussed in L. Kowalski, Phys. Teach. 24, 97 (1986). 


Abraham-—Lorentz formula, 493-494, 
498-501, 577 

Absorption, 418-423 
Absorption coefficient, 428 
Acausality, 447, 452, 495 
Acceleration 

ordinary, 553 

proper, 554 
Acceleration field, 465, 487 
Advanced potentials, 452 
Advanced time, 452 
Aharonov-Bohm effect, 247 
Alfvén’s theorem, 358-359 
Alternating current, 315 
Ampere (unit), 215, 223 
Ampère, A. M., xvi 
Ampère dipole, 271, 296 
Ampére’s law, 231-235, 242, 335-340, 

571 

applications, 231—238 

in matter, 281-284 

symmetry for, 234 
Ampeérian loop, 231-234, 250 
Amplitude, 393, 395 
Angle 

azimuthal, 37, 41 

of incidence, 414 

polar, 37 

of reflection, 414 

of refraction, 414 
Angular frequency, 394 
Angular momentum, 376-378, 384-386 
Angular momentum density, 377 
Anomalous dispersion, 428-429 
Antisymmetric tensor, 566-568 
Atomic polarizability, 167, 207-208 
Auxiliary fields 

D, 180-190, 283-285, 576 

H, 281-287, 576 
Average 

electric field, 118, 159-160 

magnetic field, 263-264 

potential, 117-118 
Azimuthal angle, 37, 41 
Azimuthal symmetry, 139 


BAC-CAB rule, 7-8 
Back emf, 328, 331 
Ball, defined, 49 


Bar electret, 175, 184 
Bar magnet, 278, 286 
Barn and ladder paradox, 522-524 
Betatron, 354 
Biot-Savart law, 221—226, 357, 564 
Bohr atom 
lifetime, 491 
polarizability, 168-169 
Bohr magneton, 263 
Bound charge, 172-178, 193, 343 
physical interpretation, 175-177 
Bound current, 276-279, 289, 343 
physical interpretation, 279-281 
Boundary conditions 
for dielectrics, 184, 189, 193-197, 205, 
345-346 
for electrodynamics, 51, 345-347 
for electromagnetic waves, 409, 413, 422 
for electrostatics, 85-88 
for Laplace’s equation, 118-123 
for magnetic materials, 286, 295, 345-347 
for magnetostatics, 249-251 
for Maxwell’s equations, 341, 345-353, 
586 
for waves on a string, 396-399 
Boundary value problems, 124-147, 193-197 
Bremsstrahlung, 491 
Brewster’s angle, 416 
Buckminsterfullerine, 158 


Canonical momentum, 449 
Capacitance, 102 
Capacitor, 102-105 
charging, 103-104, 338-339 
dielectric-filled, 191 
discharging, 303 
energy in, 103-104, 197 
parallel-plate, 72, 102, 191, 237, 558 
Cartesian coordinates, 4, 129-130, 578 
Cauchy’s formula, 429 
Causality, 446, 452, 494, 537 
Cavity 
in conductor, 97, 120 
in dielectric, 182-183 
in magnetic material, 284-285 
resonant, 440 
Center of energy, 551-552 
theorem, 551 
Center of momentum, 542 
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Cgs units, xviii, 588-590 
Charge, xvii-xvili 
bound, 172-178, 193, 343 
conservation of, xviii, 221, see also 
Continuity equation 
local, 570 
electric, xvii-xviii, 57 
enclosed, 67 
free, 166, 180, 193, 418-419 
induced, 95-99 
magnetic, see Monopole 
quantization, xviii, 386 
uniformly moving, 466-469 
Charge density 
line, 61 
surface, 61, 99 
volume, 61 
Charge invariance, 557 
Child—Langmuir law, 107 
Chromodynamics, xv, xvii 
Circular polarization, 401 
Clausius—Mossotti equation, 207-208 
Coaxial cable, 74, 323, 333, 374, 436-438 
Colliding beam, 546-547 
Collision 
classical, 514 
elastic, 545 
relativistic, 545-548 
Completeness, 134 
Complex amplitude, 395 
Complex notation, 394, 408 
wave number, 428 
Complex permittivity, 427 
Complex susceptibility, 427 
Component, 4, 38 
Compton scattering, 545-546 
Compton wavelength, 546 
Conductivity, 298-299 
Conductor, 95-109, 166, 298 
“good” and “poor”, 419 
perfect, 298, 352, 358, 430 
surface charge on, 125-126, 128, 301 
Conservation laws, 362-388, see also Charge; 
Energy 
global, xviii, 362 
local, see Continuity equation 
relativistic, 541—547 
Conservative force, 24 
Constitutive relation, 187, 287, 344-345, 577 
Continuity equation, xviii, 220, 222, 340, 
362-363, 365, 373, 569 
Contravariant vector, 532, 574 
Convection current, 313 
Convective derivative, 448 
Coordinates 
Cartesian, 4, 578, 579 
curvilinear, 37, 578-584 
cylindrical, 42-43, 578, 581 
inversion of, 12 


rotation of, 10-11 
spherical, 37-41, 578, 579 
translation of, 12 
Cosines, law of, 3 
Coulomb (unit), 58, 588 
Coulomb field, generalized, 465 
Coulomb gauge, 446-447, 573 
Coulomb’s law, xviii, 58, 61-62 
magnetic, 342, 386 
Covariant vector, 532, 574 
Critical angle, 439 
Cross product, 3, 6 
Crystal ambiguity, 185-186 
Curie point, 293 
Curl, 16, 18-19, 582-584 
of A, 243, 442 
of B, 227-231 
in curvilinear coordinates, 582-583 
in cylindrical coordinates, 42 
of D, 183-184 
of E, 64, 74-75, 317 
of H, 282 
in spherical coordinates, 40 
Curl-less fields, 52, 75—77 
Current, 215-221 
alternating, 315 
bound, 276-281, 288 
convection, 313 
displacement, 336-337 
eddy, 311-314 
enclosed, 228, 231, 282, 335-336 
free, 282, 288 
induced, 318 
polarization, 342-344 
steady, 221 
Current density, 218-221 
four-vector, 569-570 
surface, 218 
volume, 219-220 
Curvilinear coordinates, 37, 578—583 
Cutoff frequency, 434-436 
Cyclotron frequency, 213 
Cycloid motion, 212-213, 550-551 
Cyclotron motion, 211, 550-551 
Cylindrical coordinates, 42-43, 578, 579 


D, see Displacement, electric 
D’Alembertian, 447, 574 
Del operator, 16 
Delta function 

Dirac, 43-50, 161 

Kronecker, 163, 369 
Demagnetization field, 284 
Density of field lines, 65, 240-242 
Derivative, 12 

normal, 88 
Diamagnetism, 268, 273-276, 352, 355 
Dielectric, 166 

linear, 186-193 
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Dielectric constant, 187, 188 cylindrical, 42 
Diode, vacuum, 106 spherical, 39 
Dipole, electric, 65, 148, 150-156 Divergence, 16-17, 580-581 


energy of, in electric field, 171 
energy of interaction of two, 171 
field of 
oscillating, 476 
static, 65, 154-157 
force on, 169-171 
induced, 166-169 
moment, 151 
perfect, 152, 155 
permanent, 169 
physical, 152, 155 
potential of 
oscillating, 475 
static, 148-149, 151 
radiation, 473-478 
torque on, 169-170 


Dipole, magnetic, 252-255 


of electron, 263 
Ampère model, 272, 296 
energy of, in magnetic field, 293 
energy of interaction of two, 293 
field of 

oscillating, 481 

static, 254, 263-264 
force on, 269-272, 294-295 
Gilbert model, 271, 297, 482 
intrinsic, 384 
moment, 252-255, 264 
moving, 575-576 
perfect, 254-255 
physical, 253-254 
potential of 

oscillating, 481 

static, 254 
radiation, 478-482, 486 
torque on, 269-272 


of A, 243 
of B, 227-230 
in Cartesian coordinates, 17 
in curvilinear coordinates, 581 
in cylindrical coordinates, 42 
of E, 64, 68 
four-dimensional, 570 
of H, 283-285 
in spherical coordinates, 40 
Divergence theorem, 31, 581 
Divergenceless field, 52, 248 
Domain, 290-292 
Dot product, 2, 5, 531 
Drift velocity, 302 
Drude, P. K. I., 302 
Dual tensor, 568, 576 
Duality, 359-360, 482 
Dumbbell model, 498-501 


Earnshaw’s theorem, 118, 205 

Earth’s magnetic field, 223, 348 

Eddy current, 311-314 

Ehrenfest’s paradox, 524 

Einstein, A., xiv, xvi, 318, 509-510, 535 

Einstein summation convention, 532 

Einstein velocity addition rule, 513-514, 
528-530 

Einstein’s postulates, 508-513 

Elastic collision, 545 

Electret, 175, 184 

Electric field, 57, 59-60, see also 
Charge; Current; Dipole, electric; 
Displacement, electric; Energy; 
Force, electric; Polarization (of a 
medium); Potential; Susceptibility 

average over a sphere, 160 


Dipole, Thomson’s, 386 computer, 109-111 

Dirac, P. A. M., 386 in conductor, 95, 298 
Dirac delta function, 43—50, 161 curl of, 64 

Dirichlet’s theorem, 132 divergence of, 64 
Discharge of capacitor, 303 of dynamic configurations 


Discontinuity arbitrary charge distribution, 453, 
in B, 250, 286 484-485 
in E, 85-87 oscillating electric dipole, 476 


Dispersion, 424-429 
anomalous, 428 

Dispersion coefficient, 429 

Displacement, electric, 181-184 


oscillating magnetic dipole, 481 

parallel-plate capacitor, moving, 
557-559, 564-565 

point charge, arbitrary motion, 


Displacement current, 336-339, 461-464 
345, 357 point charge, constant velocity, 
Displacement vector 465-466, 559-560 
finite, 1, 8 point charge moving in straight line, 
four-vector, 533 466 
infinitesimal rotating charged sphere, 349 
Cartesian, 9 rotating electric dipole, 479 


curvilinear, 578 induced, 317, 320-324 
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Electric field, (cont. ) 
macroscopic, 179-180, 199 
microscopic, 179-180 
of static configurations 
bar electret, 175, 184 
conducting sphere in dielectric medium, 
206-207 
conducting sphere in external field, 144 
continuous charge distribution, 61 
dielectric cylinder in external field, 196 
dielectric sphere in external field, 
193-194 
dipole, 154-156, 160 
disk, 63 
finite line, 61—63 
infinite cylinder, 70-71 
infinite line, 63, 73 
infinite plane, 72 
line charge, 61 
overlapping spheres, 73, 177-178 
parallel-plate capacitor, 72-73 
point charge distribution, 59 
point charge near conducting plane, 124 
point charge near dielectric plane, 
194-196 
polarized object, 172-175 
ring, 63 
sphere, 63, 68-69 
spherical shell, 63, 73 
surface charge distribution, 61 
uniformly polarized cylinder, 178 
uniformly polarized object, 172-173, 
295 
uniformly polarized sphere, 174 
volume charge distribution, 61 
Electric potential, 76, 113, 442, 572 
Electromagnetic force between point 
charges, 465-466 
Electromagnetic induction, 316-335 
Electromagnetic mass, 500 
Electromagnetic radiation, xvii, 472 
Electromagnetic spectrum, 403 
Electromagnetic waves, see Waves 
Electromotance, 305 
Electromotive force (emf), 298-315, 328 
Electrons 
dipole moment, 262 
discovery of, 214 
spin, 263, 385 
Electrostatic pressure, 101 
Electrostatics, 57, 198, 221-222, 231, 
238-239 
Elliptic integral, 259, 266 
Emf (electromotive force), 298-315, 328 
Enclosed charge, 67 
Enclosed current, 228, 231, 282, 
335-336 
Energy 
of capacitor, 104 


of charge in static field, 88-89 
conservation of, 412, 541, see also 
Poynting’s theorem 
of continuous charge distribution, 91-93 
of dipole, 172, 293-294 
in electric field, 363—365 
of electromagnetic wave, 406—408 
of inductor, 331 
of linear dielectric, 197—201 
in magnetic field, 331-335, 363 
of point charge distribution, 89-91 
of point charge near 
conducting plane, 126 
of spherical shell, 92-94 
of static charge distribution, 88 
Energy, relativistic, 541 
kinetic, 541 
rest, 541 
Energy density 
electromagnetic, 365, 406 
of electromagnetic wave, 406—408 
electrostatic, 91-94 
in linear media, 286 
magnetostatic, 332 
Energy flux, 364 
Energy-momentum four-vector, 541 
Equipotential, 78, 96 
Equivalence principle, 507 
Ether, 60, 337, 510-512 
drag, 511 
wind, 510-512 
Euler’s formula, 394 
Evanescent wave, 439 
Events, 524 
Ewald—Oseen extinction theorem, 409 
Exclusion principle, 268 


Farad (unit), 102 
Faraday, M., xvi, 316, 347 
Faraday cage, 99 
Faraday disk, 311, 348 
Faraday dynamo, 311 
Faraday’s law, 316-324, 335, 404-405, 571 
Ferromagnetic domain, 289-293 
Ferromagnetism, 268, 289-294 
Feynman, R. P., xiv, 94 
Feynman disk paradox, 377-379 
Field, xvii, see also Electric field; Field 
theory; Magnetic field 
drawing, 109-110 
Field line, 64-65, 239-242, 347-350 
drawing, 110-111, 266 
Field point, 8, 59 
Field tensor, 566-569, 573 
Field theory, xvii, 51-53, 557-558 
Flux 
electric, 66-68 
magnetic, 307, 314 
Flux density, 284 
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energy, 364 
Flux integral, 24 
Flux rule, 309-311, 316-317, 509-510 
Flux rule paradox, 310 
Flux tube, 240-241 
Force 
conservative, 24 
electric 
on conductor, 100-101 
on dielectric, 201-203, 206-207 
on electric dipole, 169-171 
on point charge in field, 59, 211 
on point charge near conducting plane, 
125-126 
on point charge near dielectric plane, 
195-197 
between point charges, 58, 465-466 
on surface charge, 100-101 
electromagnetic, between point charges, 
465-466 
Lorentz, 211, 215, 550 
magnetic 
on current, 215-216, 219 
between current loops, 259 
on magnetic dipole, 270-272, 293 
on magnetized material, 275 
between monopoles, 342 
between parallel currents, 209-210, 224, 
227, 553-555 
between parallel planes, 237 
on point charge, 210 
Minkowski, 549, 554, 571, 575 
ordinary, 547, 549 
relativistic, 547 
Force density, 368 
Four vector, 530-533 
acceleration, 553 
charge/current, 569-570 
displacement, 533 
energy/momentum, 541 
gradient, 574 
Minkowski force, 549, 554, 571, 575 
position/time, 530-531 
potential, 572-575 
velocity, 538-539 
Fourier series, 132-133 
Fourier transform, 396, 437 
Fourier’s trick, 132, 142 
Free charge, 181, 185-186 
Frequency, 394 
cutoff, 434-436 
Fresnel equations, 416-418 
Fringing field, 201-202 
Fundamental theorem of calculus, 
28 
for curls, 33 
for divergences, 31, 582 
for gradients, 29, 580 
Future, 534-535 


Galilean transformation, 525, 532 
Galileo Galilei, 508 
principle of relativity, 508 
velocity addition rule, 513-514 
Gauge 
Coulomb, 446-447, 574 
Lorentz, 447 
Lorenz, 446-448, 453, 470, 573-574 
Poincaré, 261 
Gauge invariance, 445-447, 573 
Gauge transformation, 445-446 
Gauss (unit), 223, 590 
Gaussian “pillbox”, 72, 288-289 
Gaussian surface, 69-71 
Gaussian units, xix, 588-590 
Gauss’s law, 66-68, 238, 335, 571 
applications of, 69-73 
differential form, 68 
inside matter, 180-182 
integral form, 67—68 
symmetry for, 69-70 
Gauss’s theorem, 31 
Gedanken (thought) experiment, 
514 
Generating function, 150 
Generator, 307-315 
Gilbert dipole, 271, 297, 482 
Gradient, 13-16, 579 
in Cartesian coordinates, 12, 13 
in curvilinear coordinates, 579 
in cylindrical coordinates, 42 
four-dimensional, 574 
in spherical coordinates, 40 
theorem, 12, 580 
Green’s identity, 55, 124 
Green’s reciprocity theorem, 161 
Green’s theorem, 31, 55 
Ground, 121 
Group velocity, 424, 434 
Guided wave, 430-438 
Gyromagnetic ratio, 262 


H, 281-287 

Hall effect, 256 

Harmonic function, 114 
Heaviside, O., xvi, 211, 340, 466 
Heaviside fields, 466, 559-560 
Heaviside—Lorentz units, xix, 590 
Heaviside theta (step) function, 48 
Helical motion, 211 

Helmholtz coil, 259 

Helmholtz theorem, 51, 585-587 
Henry (unit), 327 

Hertz, H., xvi, 337 

Hertz (unit), 403 

Hidden momentum, 551—553 
Homogeneous medium, 189 
Horizon, 461 

Hund’s rule, 276 
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Hyperbolic geometry, 536-537 

Hyperbolic motion, 461, 469, 507, 540, 548, 
575 

Hyperfine splitting, 264 

Hysteresis, 292 


Images, method of, 124-129 
dipole and conducting plane, 171 
parallel cylinders, 130 
point charge and conducting plane, 
124-126, 505 
point charge and conducting sphere, 
126-129 
point charge and dielectric plane, 196 
Incidence 
angle of, 414 
plane of, 413-414 
Incident wave, 396, 411 
Induced charge, 95-99, 125-126, 129 
Induced current, 318 
Induced dipole, 166-169 
Induced electric field, 317, 319-324 
Induced emf, 317-318 
Inductance, 325-330 
mutual, 325-326, 334 
self, 327-330, 333 
Induction, 284, 316-334 
Inertial system, 508 
Inhomogeneous wave equation, 447 
Insulator, 95, 166 
Integration by parts, 35-36 
Intensity, 407 
Internal reflection, 439 
Internal resistance, 306 
Interval, space-time, 533-535 
lightlike, 533, 535-537 
spacelike, 533, 535-536 
timelike, 533, 535-536 
Intrinsic dipole, 268, 384 
Invariance 
of charge, 557 
of mass, 542 
time-reversal, 452 
Invariant, 532, 542, 565, 569 
Invariant interval, 532-534 
Invariant mass, 542 
Invariant product, 532 
Inversion, 12, 437 
Irrotational field, 52, 75-77 
Isotropic medium, 191 


Jefimenko’s equations, 454-456 
Joule heating law, 303 
Jumping ring, 319 


Kinetic energy, 541 
Kronecker delta, 163, 369 


Langevin equation, 208 
Laplace’s equation, 81, 113-118 
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in one dimension, 114-115 
in three dimensions, 116-118 
in two dimensions, 115-116 
Laplacian, 22 
in Cartesian coordinates, 21, 113 
in curvilinear coordinates, 583 
in cylindrical coordinates, 42 
of a scalar, 22 
in spherical coordinates, 40 
of V, 81, 85, 113 
of a vector, 22—23 
Larmor formula, 486-489 
LC circuit, 330 
Left-handed coordinates, 6 
Legendre polynomials, 140, 150 
Lenz’s law, 319 
Levi-Civita symbol, 294 
Levitation, 353 
Liénard formula, 489-491, 577 
Liénard—Wiechert potentials, 456-461, 
468-469 
Lifetime, 517, 519 
Light, 390-440 
speed of 
linear medium, 409-410 
universal, 512-513 
in vacuum, 402, 511-512 
Light cone, 535 
Lightlike interval, 533 
Line charge, 61 
Line current, 215-216 
Line element 
Cartesian, 9 
curvilinear, 578 
cylindrical, 42 
spherical, 39 
Line integral, 24 
Linear algebra, 11 
Linear combination, 132, 396 
Linear equation, 132, 392 
Linear medium, 409-410 
electric, 184-196 
magnetic, 286-289 
Linear polarization, 401 
Local conservation, see Continuity equation 
Longitudinal wave, 399-400 
Lorentz, H. A., xvi, 498, 512 
Lorentz contraction, 512, 520-524, 527 
Lorentz contraction paradox, 521-522 
Lorentz force law, 209-222, 238, 381, 383, 
448, 550, 572, 577 
in potential form, 448-449 
Lorentz gauge, 447 
Lorentz—Lorenz equation, 208 
Lorentz transformation, 524-531, 574 
Lorenz, L. V., 447 
Lorenz gauge, 446-448, 453, 470, 573-574 


Macroscopic field, 178-180, 199, 281 
Madelung constant, 91 
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Magnet, 278, 286 
Magnetic field, 209-210, 238-242, 284, 
554-557, see also Charge; Dipole; 
Energy; Field line; Flux; Force, 
magnetic; Magnetization; Potential; 
Susceptibility 
average over a sphere, 263 
curl of, 227-231 
divergence of, 227-229 
of dynamic configurations 
arbitrary charge distribution, 455, 
484-485 
charging capacitor, 338-339 
oscillating electric dipole, 476 
oscillating magnetic dipole, 481 
parallel-plate capacitor, moving, 
560-562 
point charge arbitrary motion, 462-465 
point charge constant velocity, 466, 564 
solenoid moving, 562-563 
of Earth, 223, 348 
macroscopic, 281 
microscopic, 281 
of rotating magnet, 347-350 
of static configurations 
bar magnet, 278, 286 
in cavity, 284-285 
circular loop, 225, 266 
dipole, 254, 262-264 
finite solenoid, 227 
finite straight line, 232 
infinite plane, 233 
infinite solenoid, 227, 233, 237, 258 
infinite straight line, 223-224, 227-228, 
233 
magnetized object, 276-277, 281 
solenoid filled with magnetic material, 
288 
sphere of linear material in external 
field, 289 
spinning sphere, 245-246, 249, 263-264 
toroidal coil, 235-236 
uniformly magnetized cylinder, 278 
uniformly magnetized object, 296, 301 
uniformly magnetized sphere, 277-278 
in superconductor, 339 
work done by, 214, 217-218, 379-384 
Magnetic induction, 284, 316-334 
Magnetic monopole, 239, 258, 271, 341-342 
Magnetic susceptibility, 286-288 
Magnetic velocity Verlet, 264-266 
Magnetization, 268-276, 343-345 
Magnetomechanical ratio, 262 
Magnetostatics, 221-222, 231, 238-239, 249, 
357, 358 


Mass 
electromagnetic, 501 
relativistic, 541 
rest, 541 

Mass renormalization, 501 


Massless particle, 544-546 
Matrix 
Lorentz transformation, 530-531 
rotation, 10 
Maxwell, J. C., xvi, 335-340, 402 
Maxwell stress tensor, 368-372 
Maxwell’s equations, 238, 335-342, 570, 574 
in Gaussian units, 590 
inside matter, 342-345 
with magnetic monopoles, 340-341 
tensor form, 570-571 
Meissner effect, 352-353 
Merzbacher’s puzzle, 351 
Method, see Images; Numerical; Relaxation; 
Separation of variables 
Method of assembly, 93 
Michelson—Morley experiment, 511-512 
Microscopic field, 179-180, 281 
Minkowski, H., 535, 577 
Minkowski constitutive relations, 577 
Minkowski diagram, 534 
Minkowski force, 549, 553, 571, 575 
Minkowski metric, 532 
Mks units, xix, 588—590 
Momentum 
angular, 376-378 
canonical, 449 
conservation of, 372-376, 542 
in electromagnetic field, 366-376 
in electromagnetic wave, 406-408 
four-vector, 541 
hidden, 551-553 
relativistic, 540-542 
Momentum density, 373-374, 406 
Monochromatic wave, 402—405 
Monopole 
electric, 149-151, 485 
magnetic, 238-239, 251-252, 258, 
341-342, 386 
Motional emf, 306-315, 509-510 
Multipole expansion 
of electrostatic potential, 147-154 
of magnetostatic potential, 251-252 
of radiation fields, 486 
Mutual inductance, 325-326, 334 


Neumann formula, 325-326 
Newton’s laws 
first law, 508 
second law, 500, 547 
third law, 259-260, 367-368, 469, 498, 
548 
Normal derivative, 88 
Normal incidence, 410-412 
Normal vector, 25, 86-88, 251 
Numerical methods 
fields, 109-110 
field lines, 110-111, 266 
particle trajectories, 111-112, 264-267 
relaxation, 164-165 


Oblique incidence, 412-417 
Observer, 515 

Octopole, 148, 152, 163, 487 
Oersted, C., xvi, 568 

Ohm (unit), 300 

Ohm’s law, 298-303 
Operator, 16 

Optical tweezers, 441 
Orthogonal coordinates, 578 
Orthogonal functions, 134, 142 
Orthogonality, 134-135, 142 


Pair annihilation, 547 
Paradoxes, see Barn and ladder paradox; 
Ehrenfest’s paradox; Feynman 
disk paradox; Lorentz contraction 
paradox; Merzbacher’s puzzle; Time 
dilation paradox; Twin paradox 
Parallel-plate capacitor, 72, 102-103, 190, 
237, 558-565 
Paramagnetism, 268-272, 275-276 
Past, 535 
Path independence, 24, 29, 51, 76-77 
Path integral, 24 
Pauli exclusion principle, 268 
Perfect conductor, 352 
Perfect dipole, 155-156, 253-254 
Permanent magnet, 278, 290-291 
Permeability, 223, 286-288, 290, 577 
of free space, 223, 287 
relative, 287 
Permittivity, 187, 423, 577 
complex, 427 
of free space, 58, 187 
relative, 187 
Phase, 393 
Phase constant, 393, 421 
Phase transition, 293 
Phase velocity, 424 
Photon, 535, 543-546 
Pillbox, 72, 288-289 
Pinch effect, 256 
Planck formula, 544 
Plane 
of incidence, 413-414 
of polarization, 412 
Plane wave, 402-405 
Plasma, 256 
Poincaré gauge, 261 
Point charge, see Electric field; Force; 
Magnetic field; Monopole; Potential 
Poisson’s equation, 81, 113, 243, 286 
for A, 243 
for V, 82, 85, 113 
Polar angle, 37 
Polar molecule, 169-170 
Polarizability 
atomic, 167 
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tensor, 168 
Polarization (of a medium), 167, 171-172 
current, 342-344 
electric, 167, 171, 342-345 
induced, 167 
magnetic, see Magnetization 
Polarization (of a wave), 399-401 
circular, 401 
linear, 401 
Polarization angle, 400 
Polarization vector, 400 
Pole (magnetic), 239, 271 
Position—time four-vector, 530-531 
Position vector, 8 
Potential, 51, 113, 442, 454, 587; see also 
Scalar; Vector 
advanced, 452 
electric, 76-81, 113 
in electrodynamics, 442—447 
four-vector, 573-575 
Liénard—Wicechert, 456-461, 469 
magnetic scalar, 244, 248-250, 262 
magnetic vector, 243-244, 261 
relativistic, 572-574 
retarded, 450-453 
Potential energy, 78 
of a charge configuration, 90 
of a point charge, 89 
Power 
dissipated in resistor, 303, 365 
in electromagnetic wave, 406 
radiated 
by arbitrary source, 484-487 
by oscillating electric dipole, 476, 482 
by oscillating magnetic dipole, 481 
by point charge, 487-493 
Poynting vector, 364-365, 373, 406-409, 418 
Poynting’s theorem, 363-365 
Preacceleration, 495, 498, 506 
Present, 535 
Pressure 
electromagnetic, 370 
electrostatic, 101 
radiation, 407 
Principle of equivalence, 507 
Principle of relativity, 508, 512 
Product rules, 20-22 
Propagation vector, 405 
Proper, 538 
charge density, 569 
force, 549 
permeability, 577 
permittivity, 577 
time, 538 
velocity, 538—540 
Pseudoscalar, 12 
Pseudovector, 12, 211 
Pulsar, 504 
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Quadrupole 
electric, 149, 152, 163, 486 
magnetic, 251 
radiation, 485-486 
Quadrupole moment, 163 
Quantization, xviii 
Quark, xviii 
Quasistatic approximation, 322-323, 455, 
456 
Quotient rules, 21 


Radiation, 472-507 

by arbitrary source, 482-486 

by electric dipole, 473-478 

by electric quadrupole, 486 

electromagnetic, xvii, 472, 486 

by magnetic dipole, 479-482, 503-505 

by point charge, 487-493 

in hyperbolic motion, 507 

by rotating electric dipole, 478-479 

by surface current, 505 

synchrotron, 493 
Radiation damping, 495 
Radiation field, 465, 488 
Radiation pressure, 407 
Radiation reaction, 493-501, 507 
Radiation resistance, 478, 482 
Radiation zone, 475, 481, 484 
Rapidity, 533 
RC circuit, 303 
Reference point 

for electric dipole, 153-155 

for magnetic dipole, 253 

for potential, 77, 78, 80 
Reflection, 410-416 

angle of, 414 

at conducting surface, 422-423 

internal, 439 

law of, 414 

waves on a string, 396-399 
Reflection coefficient, 412, 417 
Refraction, 410-418 

angle of, 414 

coefficient of, 429 

index of, 409, 424, 428 

law of, 414 
Relativistic constitutive relations, 

577 

Relativistic dynamics, 547-553 
Relativistic electrodynamics, 554-574 
Relativistic energy, 540-542 
Relativistic kinematics, 542-547 
Relativistic mass, 541 
Relativistic mechanics, 538-553 
Relativistic momentum, 541-542 
Relativistic potential, 572-575 
Relativity 

principle of, 508-514 

of simultaneity, 515-522, 527 
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special, xiv, 508-577 
Relaxation, method of, 116, 164-165 
Renormalization 

of charge, 190-191 

of mass, 501 
Resistance, 300 
Resistivity, 298-299 
Resistor, 299 
Resonant cavity, 440 
Rest energy, 541 
Rest mass, 541 
Retarded position, 456-458 
Retarded potential, 450-453 
Retarded time, 450 
Reversion of series, 500 
Right-hand rule, 3, 210 
Right-handed coordinates, 6 
RL circuit, 334 
Rodrigues formula, 140, 146 
Rotation, 9 
Rotation matrix, 10 
Runaway motion, 495, 498 


Saturation, 291 
Scalar, 1 
Scalar potential, 51, 76, 442, 459 
dynamic configurations 
arbitrary charge distribution, 450, 
484 
oscillating electric dipole, 475 
oscillating magnetic dipole, 479 
point charge, arbitrary motion, 459 
point charge constant velocity, 
459-461 
magnetic, 244, 248-249, 262, 286 
static configurations 
average over a sphere, 117 
conducting sphere in external field, 144 
continuous charge distribution, 82-83 
disk, 84 
electric dipole, 150-151 
finite cylinder, 85 
infinite line, 84, 450 
multipole expansion, 147-154 
point charges, 82 
polarized matter, 172-174 
ring, 84 
specified charge on surface of sphere, 
144-145 
specified electric field, 77, 262 
specified potential on surface of sphere, 
141-142 
spherical shell, 80, 83, 146 
surface charge, 83 
uniformly charged object, 295 
uniformly charged sphere, 80, 85 
uniformly polarized sphere, 174, 
177-178 
volume charge, 83 
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Scalar product, 3, 5, 7, 531-533 

Second derivative, 22-23 

Second-rank tensor, 11, 566—567 

Self-force, 498-501 

Self-inductance, 327-330, 333 

Semiconductor, 299 

Separation of variables, 129-147 
Cartesian coordinates, 129-139 
cylindrical coordinates, 147 
spherical coordinates, 139-147 

Separation vector, xi, 8, 15, 58 

Shear, 370 

Shielding, 191 

SI units, xix, 588—590 

Simultaneity, 515-516, 527 

Sinusoidal waves, 393—396 

Skin depth, 420-421 

Sky, blueness of, 477 

Snell’s law, 414 

Solenoid, 227, 233-234 

Solenoidal field, 52, 248 

Source charge, 8, 57, 209 

Source point, 8, 59 


Sources, xii, 57-59, 62, 209, 239, 305-306, 


317, 337, 447 
Space charge, 106 
Space-time diagram, 533—537 
Space-time interval, 533-534 
Spacelike interval, 534 
Space-time structure, 530-537 
Special relativity, xiv, 508-577 
Spectrum, 403 
Speed 
of charges in wire, 238, 302 
of light in linear medium, 408-409 
of light in vacuum, 402, 511-512 
of waves on a string, 391 
Sphere 
defined, 49 
terminology for, 49 
Spherical coordinates, 37-41 
Spherical surface, 49 
Spherical volume, 49 
Spherical wave, 438 
Standing wave, 390, 393, 435 
Stationary charge, 58, 222 
Steady current, 222 
Step function, 48 
Stokes’ theorem, 33, 53, 54, 583—584 
Stérmer’s problem, 267 
Stress, 370 
Stress tensor, 368-372 
String, waves on, 390-401 
Summation convention, 532 
Sun, age of, 107—108 
Sunset, redness of, 477 
Superconductor, 352 
Superluminal velocity, 424, 516 
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Superposition principle, 57, 79, 
94, 158 

Surface charge, 61, 100-101, 301 
Surface current, 218-219 
Surface element, 25, 39 
Surface integral, 24, 25 
Susceptibility 

complex, 427 

electric, 186-187, 207 

magnetic, 286-288, 290 
Susceptibility tensor, 191 
Symmetric tensor, 567, 568 
Symmetry 

for Ampére’s law, 234-235 

azimuthal, 139 

of E, B, D, and H, 296 

for Gauss’s law, 70 

of Maxwell’s equation, 341-342 
Synchronization, 515-516 
Synchrotron radiation, 493 


TE waves, 433-436 

TEM waves, 432 

Tensor, 11 
antisymmetric, 566-568 
contravariant, 568 
covariant, 568 
dual, 568, 576 
field, 566-569 
polarizability, 168 
second-rank, 11, 566—567 
stress, 368—372 
susceptibility, 191 
symmetric, 567-568 

Terminal velocity, 314-315 

Tesla (unit), 223, 588 

Test charge, 57, 209 

Theta function, 47 


Third law, 259-260, 366-368, 469, 498, 548 


Thompson—Lampard theorem, 164 
Thomson dipole, 386 
Thomson kink model, 467, 471 
Three-dimensional wave equation, 402 
Threshold, 575 
Time 

advanced, 452 

proper, 538-540 

retarded, 450 
Time constant, 304, 329, 419 
Time dilation, 516-519, 527-528 
Time dilation paradox, 518-519 
Time reversal, 452 
Timelike 4-vector, 533 
TM waves, 432 
Toroidal coil, 235-236, 334 
Torque 

on electric dipole, 169-170 

on magnetic dipole, 268-272 


Total internal reflection, 439 
Trajectory, 111-112, 264-267 
Transformation 
gauge, 445-446, 573-574 
of angles, 524, 530 
of charge and current density, 570 
duality, 359-360, 482 
of electromagnetic fields, 557-564 
of forces, 549 
Galilean, 525-526, 533 
gauge, 445-446, 573-574 
of lengths, 519-524, 527 
Lorentz, 524-531, 574 
of momentum and energy, 541 
of velocity, 539 
Transformer, 356 
Translation, 12 
Transmission coefficient, 412, 417 
Transmission line, 322, 358, 436 
Transparency, 409 
Transverse wave, 399-400, 404, 420 
Triangle diagram 
electrodynamics, 468 
electrostatics, 85-86 
magnetostatics, 249, 261-262 
Triple product, 7 
Tunneling, 439, 505 
Twin paradox, 519, 530 


Uniqueness theorems, 118-123, 206, 262 


Unit vectors, xi, 3-4, 8, 38, 41 
Cartesian, 4 
curvilinear, 38, 578 
cylindrical, 42 
normal, 86-88 
spherical, 37, 40 

Units, xviii-xix, 588-590 
ampere, 215, 223 
coulomb, 58, 588 
esu (electrostatic unit), 588 
farad, 102 
gauss, 223, 588 
Gaussian, xix, 588-590 
Heaviside—Lorentz, xix 
henry, 327 
ohm, 300-301 
SI, xix, 588-590 
tesla, 223, 588, 590 
volt, 79 

Universal speed of light, 512 


Vector area, 55, 252 
Vector operator, 16 


Vector potential, 52, 243-245, 442-469 


direction of, 247 
dynamic configurations 


arbitrary charge distribution, 452, 484 


oscillating electric dipole, 475 
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oscillating magnetic dipole, 481 


point charge, arbitrary motion, 459-460 
point charge, constant velocity, 459-460 


static configurations 

arbitrary current configuration, 
243-244 

finite line current, 249 
infinite line current, 247—248 
infinite plane current, 247-248 
infinite solenoid, 246 
magnetic dipole, 252-254 
magnetized material, 276-277 
multipole expansion, 251-254 
specified magnetic field, 262 
spinning sphere, 245, 263 
uniform magnetic field, 248 


Vector products, 3 


cross product, 3, 6 
dot product, 2, 5 
multiplication by scalar, 2, 5, 531-533 


Vector triple products, 7 
Vector, 1, 9-12 


addition, 1-2, 4 
component, 5, 38 
contravariant, 532 
covariant, 532 
displacement, 1, 8-9 
four, 530-533 
magnitude, | 
polarization, 400 
position, 8 


potential, 52, 243-249, 442, 459, 585-587 


propagation, 405 
pseudovector, 12, 211 
separation, xi, 8, 15, 58 
subtraction, 2 

unit, see Unit vectors 


Velocity, see also Speed 


4-velocity, 538-540 
drift, 238, 302 

group, 424 

ordinary, 538 

phase, 424 

proper, 538-540 

wave, 424 

of waves on a string, 391 


Velocity addition rules, 513-514, 528-530 
Velocity field, 465, 487 
Volt (unit), 79 
Voltmeter, 354 
Volume charge, 61 
Volume current, 219 
Volume element 


Cartesian, 27 
curvilinear, 580 
cylindrical, 42 
spherical, 39 


Volume integral, 24, 27 


Wave equation, 390-393, 401-402 
for A, 447-448 
for B, 401-402 
for E, 401-402 
general solution, 392-393 
homogeneous, 392, 402 
inhomogeneous, 447 
one-dimensional, 392 
three-dimensional, 402 
for V, 446-448 

Wave guide, 430, 433 

Wave number, 393 

Wave vector, 405 

Wavelength, 393 

Waves 
complex, 395 
in conductors, 418-423 
dispersive, 423-424 
electromagnetic, 390-440 
evanescent, 439 
in free space, 401-407 
guided, 430-438 
in linear media, 408-417 
longitudinal, 399-400 
monochromatic, 402 
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plane, 402-405 
sinusoidal, 393-396 
spherical, 438 
standing, 393, 434-435 
on a string, 390-401 
transverse, 399—400, 403 
velocity, 392, 402, 424 
water, 430 
Work 
and emf, 307, 331 
and potential, 89 
relativistic, 548 
Work done, see also Energy 
against back emf, 331 
in charging a capacitor, 103—104 
by magnetic forces, 214, 217-218, 
379-384 
in moving a charge, 88-89 
in moving a dielectric, 195-203 
in moving a wire loop, 306-310 
in polarizing a dielectric, 197-206 
in setting up a charge configuration, 
88-91 
Work-—energy theorem, 548-549 
World line, 534-535 


BASIC EQUATIONS OF ELECTRODYNAMICS 


Maxwell’s Equations 


In general: In matter: 
1 
V-E=—p V-D= pr 
€ oR ôB 
V xE = -— 
VxE= Or Ot 
V-B=0 YB? 
OE _ oD 
VxB= pod + poeo5- Ve 
Auxiliary Fields 
Definitions: Linear media: 
D=e)9E+P P = €0XeE, D=cE 
1 1 
H = —B -M M = XnĦH, H= -—B 
Ho H 
Potentials 
E=-VV-—, B=VxA 
Lorentz Force Law 
F = q(E+v~xB) 


Energy, Momentum, and Power 


1 
Energy: U = 5 f ( 


Momentum: 


Poynting vector: 


Larmor formula: P = — 


1 
Eo E? + — B?) dt 


Ho 


FUNDAMENTAL CONSTANTS 


€o = 8.85 x 10-1? C?/Nm? 
Ho = 470 x 10-7 N/A? 

c = 3.00 x 108 m/s 
1.60 x 10°C 


(permittivity of free space) 
(permeability of free space) 


(speed of light) 


e= (charge of the electron) 
m = 9.11 x 107°! kg (mass of the electron) 
SPHERICAL AND CYLINDRICAL COORDINATES 
Spherical 


x = rsin 0 cos ọ 
y=rsin Osing 


z=rcos@ 


i deren 
0 = tan! (fx? + y2/z) 
Q = tan! (y/x) 


Cylindrical 


x = scos@ 
y=ssing 
Z=Z 


S=yxet+y? 
= tan! (yix) 


Z=Z 


X = sin 6 cos Of + cos 0 cos pô — sino 
y = sin 0 sin f + cos O sin p + cos op 
Z 


= cos Of — sin 00 


f = sin 0 cos fx + sin 0 sin fy + cos OZ 
0 = cos 0 cos Px + cos 0 sin Py — sin 02 


= -sin på + cos by 


= cos p$ — sino 
= sin p + cos pọ 


p> 


